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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W— I ryptopnan, Y= tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7283 


21184 


A 


7343 


306 


164 


IYMLYFIYITLIYIHTHTHTHLYLDKYI 
YL * CVHCFD * * AYM I CHQR 


7284 


21 185 


A 


7344 


442 


8 


&Qm!sKLt}S.l RAVAHAWi>Jir'£> X JLiGGQGRR 
IT* VRE FET SLGNTARPNL YWN 


7285 


21186 


A 


7345 


3 


248 


HELSLSLLLSPFFFKNEKGSR*VAHAGL 
KL IiTS S L> PP T * AS QS AR I TG I SHCAR PL 
LSHFLAYLALPCHQNRLCTGHLPST 


7286 


21187 


A 


7346 


2 


246 


HELYQS ILLiSPIiFFKNDIGSR* IAHAGL 

QLiIjT* sdppt*asqsaritgishcarpl 

LSHCLAYLALTCHQNRLCTGHLPST 


7287 


21188 


A 


7347 


1 


130 


GTRLLKRLTQETRLNPGGRGCREPSLHY 
CTPTWARQSQTVAHN* KRLTQETRLNPG 
GRGCREPSLHYCTPTWARQSQTVAHN 


7288 


21189 


A 


7348 


516 


288 


FFETESRFITQTGVQWCDLGPLQPLPPG 
FKGFSCIjSLLSS *dyrclpphpanfci f 
SRDGVS PC *LGWSQTPDLK 


7289 


21190 


A 


7349 


417 


148 


STS VWQAGVQWHDHGLLLPGPPRLK* S 
SHLSLPHG*DYRHILSCLANF*NIFVEA 
GFRHTAUGLiiiPIjGSSDLiPASASQSAGJT 
GMSHLA 


7290 


21191 


A 


7350 


143 


2 


TQ KFKT S Ij S S WD YRR V S AH V ANF C I FS K 
DRVSPY* PGWSRTPDPARA 


7291 


21192 


A 


7351 


189 


37 


EAIAWRII*NLFFFFFF*RWGLTVLPK 
LVSNSWAQAILSPQPSKVLRLQA 


7292 


21 193 


A 


7352 


131 


2 


KS RLGT V AHACNP S TLGGRHGW IT* SRE 
FKTSRANMVKPRPRA 


7293 


21194 


A 


7353 


1 


106 


GTRGQCLTML PRLVLN YWAQM I L P P WP P 
*VLGLQV 


7294 


21195 


A 


7354 


939 


653 


KGNFI FLRQS FTL VAQAG VQ WRDLGS P Q 
PPPPGSK*FSCLSLRSSWDYRHGPPHPA 
NFAFLVEMGFLHVGQAGLELPT*GDPPT 


7295 


21196 


A 


7355 


270 


26 


EARFLGLLRDI AQLCYPGWSAAV* S *LT 

T\ TV OrTip TVT 7\ <— \T 71/71 C rpTTT C T niVTJVT + T"\ ■> 7\ T~» 7V miVir 

AAb i b W Ay V KAb lHijbJjP]NJN*D*ARAJ;iyi 
PG * LKKYVF F * RQGLA I LPRLRLVF 


7296 


21197 


A 


7356 


329 


206 


HHIiANF * T F CRDRGLTMLAR LNS WP QA I 
liLPQPPKYLGLQM 


7297 


21198 


A 


7357 


128 


253 


MRPDT VAHTCNP S TLGGQGGR I T * THEF 

TjimnT f it^tujiwit./'ttvt ~\r~rr 
iL x S IjGJJMM K. P YLi YK 


7298 


21199 


A 


7358 


2 


108 


AREGSGGQGCSEP * WRHGTAAWVTEQDP 
VSKTIKK 


7299 


21200 


A 


7359 


1 


340 


GTRERNWHIiA * ADHTHTHTHTHRHTHTH 
THFSFPANS *RDTFVI F* QSLLLE IMTS 
FSPSLGLKCSFSQQPSGIjLSSIAGERDS 
R I HGENT VS M CLHNRL KAELRS D V I S LR 
V 


7300 


21201 


A 


7360 


385 


1 


SVGPSKPCAGYNLLVCHLjDRPLEKRSIR 
VGVT* FSRCHLSPLSLTRKANSLTPCTS 
WMRRCL ILLRLTLGVPHPMSCTHCPTILP 
S E MNLG P QLE MQKS P I F C VAHAG S PRLE 

T TPT 1Tr i 'HT flQTD'MT DQP 
i_ir JLj c vjxIJjvjo 1 ir In J_i Jr o ^ 


7301 


21202 


A 


7361 


234 


29 


LSTEPKGSKALKFNVTLLLNIAF* CNSG 
CNSNL* S * QQY*RVIQSHHLSQTLWLTP 
VI PALWE AEAGG 


7302 


21203 


A 


7362 


68 


339 


DRD I ANKPAWQLCQSR I LYLAQLQ F I DQ 
GELLPDFRIHYKVI * T C TAWH * RRNRL I 
DQWNRAD I PDLNP CQAWRLTP VI L VLWE 
ADVGGS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 

ZD 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanme, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K==Lysine, L==Leucine, IVI^IVIethionine, 
N=Asparagine, P=ProIine, 
Q~Glutamine, R—Arginine, S^&eriney 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7303 


21204 


A 


7363 


168 


2 


CFYSTHAHIFKELRQRPGAVAHAYNLST 


7304 


21205 


A 


7364 


399 


62 


QPGQYGKHPVLIKNSKIKPFWGDPPWP 
NAREG * A* KM VE PGKVRVQSAQ I KAL E F 

"NTT r l TDTTTT'l<n7'DTr'Tf TTT^TCTTDTaTT A"tf7T l r l T DDUTT) 

AFGKASKGGSPRSRILRPPGFLPYPAHF 






.f-i 




zoy 


fo 


v L.JJ V JjKJijOlXj^y IjoKJjzvv^ooX X. ^AriL.lN Vo 

IAGSSNCHASALRVPGSTGVWVESERGD 

■\7TVD TTTT 1 & VRWflWP a 
v XaEXJ? JCjjri. X vjT0x\.O X V V^/i 


7306 


21207 


A 


7366 


364 


182 


QWHDHOTFKLLGS SNPPTLAP * VAGTAG 
TCHCAWLI*FI ! TWKIGFKTSLKIIVLLG 
YFWF 


7307 


21208 


A 


7367 


155 


2 


ELGTKREG WE KS AT VAHACNPS SLGGRG 

P ffl T T 1 * P4 "P P FT P T* Q M A TSTMM Pf TT P P ZV 


7308 


21209 


A 


7368 


342 


184 


LSLPSSWDRRYTPPRLANFFKFFFKR* S 
LTMLPRLYRPFYI TTHQTLfTSAMTF 


/3uy 


21210 




7369 


2 


211 


i\Kiilr JXCLi* xDJiW J-JSIKlYKj x XiJAVjixHxjAx. 

KLNE I LTHATTWMNLENML I GRS * TQ KA 
TCCVIPFMDSIYV 


7310 


21211 


A 


7370 


190 


344 


YKNVT VKRILiT WP S T VAHACNP S TLGGQ 
'jcrKr ur x oyjix 1 Ji 1 oJjljiN.J.AK.FL.ij x 


7311 


21212 


A 


7371 


24 


254 


KPQGIGAHASSQHYGRPRREDHLSLGV* 
DQPGQHGEIPSLQKKKKKGLPGRGGVCL 
KPW FLK I LGE KNVWVLT P KG 


73 12 


21213 


A 


7372 


43 


358 


DbJLNXjJN i b>±jc r HGhiAUijG I lQVXiTHPS IT 
AMYFVHYCQS P*I LYGTINT * P PWHKN 
PIHIKTPSPCLQASTAINLQLSHINCNS 
KATPHPLGYQQTYPPLTVHST 


7313 


21214 


A 


7373 


2 


288 


IHI I LAFTSSLIiG I LVYRSHL I ASLLCL 
HGI ILSLTI IAPLI PLNTHSLLAPIGP I 
AIIiVlAA*QAAVGIiAl J I J VSISTTYGLNY 

vhnlallqccn 


7314 


21215 


A 


7374 


335 


1 


VUWJNJjE CjSIjUWPPPG vSPr r WijSIjPrNJNlj 
GFKGPPPRPGPFLKF* *KPGFSALTRMV 
S LSRPRDS APLAP P KAGVSRLNPRARAP 

TrCTJ'PPT/Tj'P'trtJTPTj'T T OPUWCT ^UVPPT 


7315 


21216 


A 


7375 


399 


91 


RIFFFPPSLKAPPHRQKPTPFPPTKKPL 
V Jf i^lJcirXv ±rf ]\v Jj J\JSJM o Jr W Ji V Jt* Lix 1 Jbo 
FFQKDAFFGPQV*KGNFFFFKGPPPLTQ 
NS PQGVP PKKKKKKKKKKN 


7316 


21217 


A 


7376 


377 


131 


YLY I KF YNF 1 1 1 LF Y I F I FN IF* FNM I F 

T\n\T T7VTTTVVTT TTSTTTTCTTT TTT TTTVT1?I?VT?T 
In JLiXjI? III? ii 1L ±Vi V 1M X.XjX XXjX ±.SS C c x X 1 J. 

LLiIILITIILILYIIIYYFIIKISrrrKT 


7317 


21218 


A 


7377 


2 


156 


FHHW* AGLELLGS SNPPTWAFQSAETT 
GVSHHAQLPHFILRKQELLSLGL 


11 1 o 


1 1 1 1 n 
2121^ 


•B. 


/J7o 


319 


59 


KQKSAPRNGNEVP * THQHSTATTLISQQ 
VF 


7319 


21220 


A 


7379 


308 


12 


FSPTISNFNYKSNPSPLIYLFQ* PITQL 

Xrmj-ullMi ~ J. 1 LN IJ C I_ JLllv _LJ llolo J_ O 1. 1 i 1 

PISNNVKIK*QFEHTKPTPFLPTLIALT 
TLLLPISPPILIIL 


7320 


21221 


A 


7380 


11 


169 


RRDLAMVPRLFSSSPGLK*SLHLGFPEH 
WDYSHEPPSWLLFFliFLITEGYFF 


7321 


21222 


A 


7381 


300 


2 


KFFP PPGFFFKGFFS PL* SHQKKTQI FF 
GEFFWGFKKFFLFFFPPGGFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFLFFFF 
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SEQ II> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ 10 
NO: in 
USSN 
09/515,1 

ZD 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 

icq u ciicii 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

peptide 
sequence 


Amino acid sequence (A- Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=Lysine T L—Leucine, M—Metbioninej 
N = Asparaginej P—Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
-<a. — univiiuvvi], — oLop couon, / — possioie 
nucleotide deletion, V=possibIe 
nucleotide insertion 














-C r JL AN O DC JL^iN O S2 J_IJL 


7322 


21223 


A 


7382 


163 


160 


PGTSNDEIQRRNSRNRISPIAER*GLSL 
LPRLEWMGSIn"EAHSNLELQGSSNPPASA 

LELQGSSNPPASASKSTMITGVSHCA 


7323 


21224 


A 


7383 


1 


183 


VCVCVCVCVC VC I KWDS YSAI KKNELVA 

T?\R7T , TATTrPT\T7 r PT T T QP\7QAf?*'Kr r T , TVUMC!C 
JT V V 1 W rviXi-ixi 1 J. JLIjoJlIi V 71 In X X. XrilXtoo 

LICGS 


7324 


21225 


A 


7384 


2 


196 


KIQM WPG WAHACrPNTLGGQDGRTA^G 
LTYDGDKS 


/J2D 


ZiZZb 




/otto 


3 


\yZ 


RPPPCPANCFMLCEFFRTNGVFPCSPGV 
YFEPRN 


IdZv 


h 1 im 
ZlZZ J 


7\ 
r\ 




ZOO 


19o 


TMT Q f^iTTT .TP Q T± C1T -.TM TQ* DttnDD Z\ CJ /\ Q f"\ Q 
IrJLioar J_irCoi-xVj-u 1 1*1 J_o ir rilJir rHofioyo 

AGITGVSHRARPLVLFLRDSLYSGTQAG 
VPWRDLS SLQSRLAEQMRFCFP 


7327 


21228 


A 


7387 


164 


311 


SKQQKGLSGVAHPCNPS ILGGRGRWIN* 
GQEFETSMASMVKPCLSLLKL 


7328 


21229 


A 


7388 


3 


218 


EIMSFAATWMQLEANILSTLSHE*IAKY 

r~*lV/fT PT T Dr , 7\T7TITiTt7'T H/dVTT TPt IT"* m/~1 TATinTWT 

UYUjIb JjLSGAKIIW V LiMiN Iht Vbi GJJ1RD ih 
GCTYP S S TEQLG Y YS 


l3Zy 


o i Tin 
2123U 


A 


7389 


DZ1 


58 


G*FSHLSLPSSWDYKYVPPCPASFLYYF 
LVETGFCHVAQDGLELLDSNNPPSSASQ 
SAG I TGMSGCAGLRRF*KLRSWRYLVLL 
AMQVNEQGTSQAGEAVSEADAGQGCCCG 
T.r~ , T."P"P TT.Qnnv^T .r> 


7330 


21231 


A 


7390 


169 


322 


APGNFKTN*RKNGFWAGNTLGVSVKNMK 
NGWAWWLTSVI PALWE AE AGGS P 


7331 


21232 


A 


7391 


1 


353 


GT S P H YGS FHGHQ KD VTGKG SQSRPRER 
VLGSHARKNS LP VHE VKAYLL * KYLSI L 

GFFLLF FLLLLTLS FCSWHL IIS FLALF 
VIPLY 


I55Z 


2123.3 


7\ 
/\ 


hi no 
7392 


372 


3 


DMSWSLCLFLSGAISVLCNFRLSGSSDS 
PMS A * QAARTAVI RHHTRIi I FC I FSRDR 
ETGFHHVGQENLLMLDLQLFSGL.GFS I S 

A TT ,T,TT ,1VIT .T *\7 


7333 


21234 


A 


7393 


362 


176 


ARMECSGAI * AGLRGS SDS LAS AS * E AG 
YTGK CGHAQ * T F VFLVE VR FHHVGQNA I 

IN J.x\. JL O J_i 


7334 

i 


21235 


A 


7394 


2 


304 


ARAD C CARNE * G S VGMGP S E PG VG YNLL 
VFRFLRPLEKHS IRVGETRFSRCCLSQL 
CLARTGNSLTPCTSWVRQCIiALLRLMLG 
ALHPLSCTQRPTSHSE 




O 1 O'Z/C 
ZIZJO 






ZDO 


5 


P YNP S TLGG*GGQ I S * AHKFETS LGNMV 
KPHLSKNKQTNKQTNKQTNISWSWCTCL 


7336 






/ J70 




Z 


GPLNFTFHLQRKNCNLGHTRSPGGVYHV 
GRTETATGPIKRRNGLGAVAHTSNPSNL 
GGQGGWIT*GQEFKTSLSLPRWLTPVIP 
LV 


7337 


21238 


A 


7397 


2 


242 


RR YS D Y PDAYT T * N I L S SVG S F I S LTAV 
ILIIFMI *EA*KKKKKKKKKKKKKGGPF 
KKTLGGPKFNRRMEGKHFSLKGGR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K-L.ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7338 


21239 


A 


7398 


1 


289 


TKLLGLYYTLLQASNDFKSPFSIFDGIY 
GSTFFVTTGFHGLHVIIGSTIIiTI*FIR 
QL I FHFTSKHHFGLEADA* YWHLADARR 
HSESITHQCCSL 


7339 


21240 


A 


7399 


346 


40 


IIIFTNKIYIKINKRLKLFIIKILDLYI 
I*IFFFRK*FTFKQ*KF*IININKIYYN 
NIK* * IFIYLFFFFFFFFFFFFLFFFFF 
FFFFFFFFFFFFFFFFFFFLKK 


7340 


21241 


A 


7400 


1 


334 


NSCRRQRLLDLLLLPITL*LPQLIGYIE 
KSTRYE CGFHP I S PARVP FS I KFFLVAI 
TFLLFDLQIALLLPLP*ALQTTNLPLIV 
MS S LLLi III LALNLAYE * LP KGLD * AE 


7341 


21242 


A 


7401 


2 


302 


HTINKLGERYVLATTY* *STMTLLLTGL 
T VLHTGLYS VYLMTTTQWG* LTPHI INM 
KP * FTR* DTLVFIHLSP ILLLSLNPD 1 1 
TGWLVLLSYELYLLT 


7342 


21243 


A 


7402 


57 


338 


TTHNVIGYIEASTAYESGCDP IS PARVP 
FSIKFFLVAITFLLFDLEIALLLPLP*A 
LQTTNLPLIGMSSLLLI IILALSLiAYE * 
LQKGLD*AE 


7343 


21244 


A 


7403 


407 


68 


QEELVDPLTTVREQCEQLENCVKAR*RL 
* LCDERVS SRSHSEEDCSEDLFDFLHAR 
DHCVFLILYINLKYMCELT 


7344 


21245 


A 


7404 


494 


154 


SLFI FYLYL I FLF * * FFFHYFF YFLLS I 
FSFKLFFFFLLHFFFHMFFFLFFSPPNK 
IFFPKIPPRFFLFPLF**KFFFFLPFKF 
LSPLGFFF*SPPFFFFFFFFFFFFFFFF 
F 


7345 


21246 


A 


7405 


1 


400 


NPGRFLSTSNSSLYERTREIRPTSQSAF 
PRK* YHLKKKKKKKKKKKKKKKKKKKKK 
KKGGPLKKKPRGGQK*TGGEKKKFFPKR 
GGKKKP PGKF * KKT F FWGG KKMG KT PQ K 
KINPLGKKKI FKGKRGKTPPL 


7346 


21247 


A 


7406 


2 


257 


RASLCHPGWSAW* SWLIAALTSWLKQF 
SHLNSLKCWDYRHE PPHLA* LRFEHRHS 
GSSICTLNR*ILLSFPILSIRYKNGLIL 
Y 


7347 


21248 


A 


7407 


259 


47 


YVFFFFLFLLFFIRLQSVSPFFFFFFFF 
FFFFFFFFFFFFFFFVK*S FIEMFS F W 
L YLLRQVS I S IAYTLFG 


7348 


21249 


A 


7408 


3 


248 


VTFL*CDVEIAVLIPLP*ALPTTNLALI 
DMSSLLLIIILALRLAYE*LRGGLD*AE 
EHQATHSAYERAPPL 


7349 


21250 


A 


7409 


2 


350 


RS GM P RRY S DYPDAYTT * N I LGS VGAF I 
CLTAGILI I FMI * EAFASKRKVLIVEQP 
SINLE*LYGCPPPYHTFEEPVYIKSRHI 
RKESNP P KLVS S QPHGLHD FFKKKLGT I 
TEKN 


7350 


21251 


A 


7410 


2 1 


232 


LDQTYAKIHFT I VF IGVDLTLLPQHFLG 
LSGMPRRYSDYPDAYTT*NILSSVGSFI 
ALTAAI L 1 1 FM I * EAFAKKF 


7351 


21252 


A 


7411 


2 


321 


TSLLAVLLIIMTL*LPQLNRYIEKSTPY 
E CGFD P I C PGRVP FS I KFFLVAI T FLLF 
DLETALLLPLP*SLQTTNLPLIAMSSLL 
L 1 1 ILDLNLAYE * LQKGLD * AE 


7352 


21253 


A 


7412 


2 


194 


S I ECFLVP I TLLLCDLE I ALLLPLP *AL 
QTSNLPLIVMSSLLLIIILALNLAYE*L 
QKGLD*AD 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G^Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L— Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y—Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


7353 


21254 


A 


7413 


1 


197 


LYTTYFVTKTLLLTSLFL*GRTAYPRLR 
YDQL IHLL * KNFL PLTLALL I * HVS I P I 
TISSIPPQT 


7354 


21255 


A 


7414 


1 


185 


STPLP*ALQTTNLPLIVMSSLLLIIILA 
LS LAYE * LQKGLD * AEKKKKGAAVLKD P 
SGGPS 


7355 


21256 


A 


7415 


347 


228 


LE*LYGCPPPYHTFEEPVYIKSRLKRKE 
S ISTPP KL VS 3 QPHGLHD FFKH I KNYRKKK 
RKEGLASVKENDRGP 


7356 


21257 


A 


7416 


1 


296 


HAYDHAS AQLNGY I EQSTPYE CGFDP I S 
RARVP F C I KVFLVAI TFLLFDLE I ALLL 
PLP*ALQTANLPLIGMSSLLLIIILALS 
LAYE * LHKGLD * AE 


7357 


21258 


A 


7417 


3 


202 


TCSI* HVLVRMT * L I YDLE I ALL * P VP * 
ALQS TNLPLI SMTS LLL 1 1 I LALSMAYE 
*LQTGLD*AE 


7358 


21259 


A 


7418 


1 


321 


SNTLLALVLITITS *IiPQLSGCVGECTP 
YECGICRILNARALCCMQIFLKAITFLL 
FDLEIALLLPLP*ALQTTNLPLIDMSSL 
LL 1 1 1 LALS LAYE * LQKGLD * AE 


7359 


21260 


A 


7419 


1 


212 


RTRGIIFRPLSQSLY*LLAANLVILT* I 
GGQPVSYPFTIIGQVAYVLYFTTILILI 
PTICLIENKILKWA 


7360 


21261 


A 


7420 


2 


257 


GRVGGRVGYS I AFFLQAIT VLVLDLES A 
LLLALP* S LQTTNLS LI VMS S LLL 1 1 1 L 
ALSLAYE * LPLGLD * AE YWGEMMTTTLD 
A 


7361 


21262 


A 


7421 


367 


269 


HCDLKLLGS S DP PAS AS * VAW I TGTCHH 
TKLS 


7362 


21263 


A 


7422 


2 


116 


ALQTTNLPLIVMSSLLLIIILALRLAYE 
*LQKGLD*AE 


7363 


21264 


A 


7423 


2 


336 


KPECGREG ILPCCS S SAWPEGSFRPFQM 
NLGSFLSFFFLFFFFFEKGLFLCPPV*S 
PYGKSNLLEPSASRFKQFFCLGPLNNWD 
YRLFPPFPGNFGVFKKNRVFPFWPSLV 


7364 


21265 


A 


7424 


3 


154 


LFL*IRTAYPRFRYDQLIHLL*ENFLPL 
TVALLI*HVSIPITISSIPPQT 


7365 


21266 


A 


7425 


3 


154 


LFL*IRTAYPRFRYDQLIHLL*ENFLPL 
TLALLI*HVSIFITISSIPFQT 


7366 


21267 


A 


7426 


1 


314 


INTLLTLLLIIITL*LPQLNGYIN*STP 
YE CGFDP I S PARVP FS IKFFL IAITFLL 
FDLE I ALLLPLP * ALQATNLPL IDMPSL 
LLIII LALS LAYE * LHQGLY 


7367 


21268 


A 


7427 


1 


331 


I L I INTLLALLL 1 1 1 T F * LPQLNGY I EK 
STPYECGFDPISPARVPFSIKFFLVAIT 
FLLFDLE I ALLLPLP * ALQTTNLPL I VM 
TSLLLIIILALRLDYE*LQPGLD*AD 


7368 


21269 


A 


7428 


2 


373 


SDRNHI IHALVI T I LLGLYFTLLQASEY 
FESAFTISDGIYGSTFFVATGFHGLHW 
IGS TFLT ICFIRQLI FHFTS KHHFGFEA 
AA*YWHIVDVAGLFLHVSIYRGGSYNHQ 
IIPSVAQESTPA 


7369 


21270 


A 


7429 


378 


63 


PSFFPPFPP*NFFFPPRPLFSPGGFPPF 
FPPPQKGPPPKIPPGFFFSPPFWEKFFF 
PPPPLILPPPRFFFNPPPPFFFFFFFFF 
FFFFFFFFFFSFFFWDFLGSGC 


7370 


21271 


A 


7430 


1 


109 


QTTNLPLIVMSSLLLIIILALSLAYE*L 
QKGLD*AE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C— Cysteine, D~ Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaianme, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=ProHne, 
Q=GIutamine, R==Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possi ble 
nucleotide insertion 


7371 


21272 


A 


7431 


358 


290 


FLLVIPJSTFLFHFLr YIIIS I VPNVSFFH 
IIYILSIFFPFIIFILLLSSFNFILFFF 
FITSFPFFYFFFFFFFFFFFFFFFFFFF 
FFFFSFFFFFFF* FFFTN 


7372 


21273 


A 


7432 


1 


95 


PTRPPTRPHNSLQL*TPGLKQSYPRLPS 
S WD * RHM P SH P ANFRKF F * RWHL YC P GW 
S*TPGLKQSYPRLPSSWD 


7373 


21274 


A 


7433 


1202 


870 


HNI FKIRPHCRLTHRGKPFYWGLLVTPV 
FSPSLRPSFLPSFLPSFLPS*LPSFLPS 
FLLFLSYSFCPLSFLCLRCIPHSAFFWG 
NPRQVQENCVI VT I YRYLS FTALHQL 


7374 


21275 


A 


7434 


163 


1 


S VYVKRIGLGRVAYTCNP S TVGGQDG * I 
T*GQEFETIIiAKRREPLFLACFLFLI 


7375 


21276 


A 


7435 


148 


362 


NDDNSVEFGIRLHKGLGTVAHTYKPSTL 
GGQGGHIA*AQEFMTSLGHMT*SSLYHK 
LTEYGWRDTASLLMC 


7376 


21277 


A 


7436 


61 


233 


GGDLRHWPSSDSFFLSLSFFF*NKNGGG 
LALLPRLDLNSWSQAILPLKPPNALGLQ 
A 


7377 


21278 


A 


7437 


44 


242 


FVCLFLFFFFEKESTFQHDLSSLQPPLP 
RFKRFSCLSLPSTWDYRLAPQCIiANF* I 
F**KSGFTFG 


7378 


21279 


A 


7438 


139 


300 


YCVSLLIFFIFIIYIFILLIFKFLFITL 
NYIS FLFFYF*H* FYF * KNF I FVFPFGY 
FPRP 


7379 


21280 


A 


7439 


1 


221 


LKLQHHDPTTISLVITRNWKQPKCPNVQ 
Q I VH* SAI KRNELL IHAKTWMNLRGI ML 
SEICQPQKILYLEIPFI 


7380 


21281 


A 


7440 


35 


345 


FSFCCVCVCVCVSLPKHAI IEMLNQTHP 
YVPKGVAYLPEREPFIVPMDPELTA*YE 
DYRTHESAQEPSEPHQLSRRSLPH*PGH 
ATRDHS PPTDSWGKRRTSH 


7381 


21282 


A 


7441 


3 


384 


VIHL*AGAVIIGSRSKSTNALAHFLRQG 
TPTPVILVLGI IETMNLLIQPVAI^AERL 
TDNITAGHLLMHL I VSATLAILT INLTS 
TLIMFTVLILLTILQIAVALMHAYVFAL 
LVCLYLHDDAYDHGG 


7382 


21283 


A 


7442 


15 


277 


GLAILPRLVPNSWPKVILQWPPKMLG* 
QA 


7383 


21284 


A 


7443 


21 


694 


NGGLNAHLAS AS E F DHS GVQ LIEREEEI 
CIFYEKINIQEKMKLNGEIEIHLLEEKI 
QFLKMKI AEKQRQI CVTQKLLPAKRSLD 
AD LAVLQ IQFSQCTDRI KDLE KQ F VKP D 
GENRARFLPGKDLTEKEMIQKLDKLELQ 
LAKKE E KL LEKD F I YEQ VSRL TDRL CS K 
TSGCKQDTLLFTKPCMNGYSRRIKWATEK 
MMALCA* LSMKQAPT I ELQKE VQGERRP 


7384 


21285 


A 


7444 


141 


428 


DRSLLEAGDGRIAGDFFFF*TGTNFVAQ 
AKVQWGI HS S LHP * P PGLKQS S YLNBLG 
S YHHWLRPHALFFFL KNRGS L FC PG WFG 
TGFRGTTFWGAK 


7385 


21286 


A 


7445 


35 


343 


GYSIIGCPKFLDEKIFDSGHEKERKINS 
LLF* KKKKKKKKKKKKKKKKKRGGLLKK 
TSGGP KI KRGGGI KNFP I KGGVKKTSGE 
FFGKKPFFGGEKFWE I LP 


7386 


21287 


A 


7446 


403 


193 


EYIKQGAE*NNQPKLHAHRKRLTWKETP 
RCIHEGIAPAIMNSDCLVFDTSIAQLFA 
ENGNLGINVTISMC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HKHistidine, I~Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine 7 S=Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7387 


21288 


A 


7447 


411 


110 


LTDSLINIVYR*KMSYIS*TNSA*VYLY 
IFCIYIYIYIYIYRRIYSIYRC*YIYYG 
PAH I VLVYS S S Y I LVYYE SHKYGYIMGL 
NILSYIFIYSYFSACS 


7388 


21289 


A 


7448 


1 


380 


PTRPQKLKKNCNGVSNGDLCFFGGGQFL 
IWGFFFFFCILIEGGLNSKNYEDKSLIK 
T YGAKKAE I EHTEN* KNRETLLERAAQL 
A I KVTNPNAMLRT VEHD YAM YMHNF WLN 
KLNNCHKVAISCKQ 


7389 


21290 


A 


7449 


200 


437 


VQSNNNKNFKRRRL*PGVVAHTCNPSTL 
GGQARRIS*AQELETGLNNLMRPCNYNK 
F INVLDDVHGVTS PVSQ YYKKVC 


7390 


21291 


A 


7450 


405 


3 


PPRFFGFFFPFYPLKFFFFPKGFNFFGG 
FFP I FFPPKKKFFFKNS PGGFFLPPFLG 
KIFFFLPPFKFGPPRGFF*RAPLFFFFF 
FFFFFFFFFFFFFFFFFFFFFLQRINFT 
ARPGTPLSRAVPLRGALPPSA 


7391 


21292 


A 


7451 


2 


348 


PRVRPRVRYKLQTKRCDLTLPS *GHSHV 
SLRNTTFDALKS * DTDSLLS PKLECHGV 
NTPNCNLRLPSLSDYPASLSREAWIAST 
RLRAWE I FVFLAQMRQD YCMNP GGGG CN 
ELK 


7392 


21293 


A 


7452 


1 


168 


LVHDGHAGLKLLTSGDRPA* AYQSAGI T 
GVRPLGPPTYYYFRTASFMTFI CEFATV 


7393 


21294 


A 


7453 


184 


260 


SSRGGSLASRSSR* STLLGLPKCWDYRC 
MPPSLANFFFFNF * I *NFCFW* RRSLGM 
LPRLWNS * MQA I LPLR I RLQG I HFAGT 
QTFSL 


7394 


21295 


A 


7454 


405 


64 


FFFFFNFF*KKKI*DVFAKIFFKIFFFF 
SGLKI FWGGGFQNAPPQKKFFF * KIKS F 
FFLNFFF*KNIFFFVAGWFFLLNFFFR 
GAPLFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFF 


7395 


21296 


A 


7455 


368 


2 


SQS FGRPRWENRLGPEQH I KI SRYRPAQ 
VHACHPSTLGGRGRWIT*GQEFKGSTHL 
GLPKYWDYMPEPLLLVYIQCFHFRTFCN 
KAAIHCFKPEQLLYNKTSYFKESYKVKA 
ALVACKPTRP 


7396 


21297 


A 


7456 


2 


183 


TERDS VS KNKY I Y I HTHTHTHT Y I CVHG 
VIYVYI*SLLLIYNIAYILIYHVFYILL 
IFKA 


7397 


21298 


A 


7457 


281 


127 


YNFC I FM * RCGFNNVGQSGLKVLLRGYS 
PAS AS Q I AG I TG VS Q CAQ P VA YS 


7398 


21299 


A 


7458 


2 


194 


TSVRKYF*YPFTITDRIYGSTWWATGL 
HGLHVIIGSTFLTICLIRQLIFHFTSYH 
HFGSEAAG* YWHFVEPARLFLYVCIY* * 
GSTWWATGLHGLHVIIGSTFLTICLIR 
QL I FHFTS YHHFGS EAAG 


7399 


21300 


A 


7459 


22 


319 


GGS VTGAYHR I LDHDRNQ I FLA* GGSVT 
GA YHR ILDHDRNQ I FLALL I SMLLGL YF 
TLLQASKYFECPFTICDGIYGSTFFVAT 
GFHGLHVI IGSTFLTICWIRQLILHFTS 
RHH FGLE AAA* YWH F VD W * L FL YVS I Y 
* * GS VTGAYHR I LDHDRNQ I FLALLI SM 
LLGLYFTLLQAS KYFE CP FT I CDG I YGS 
TFFVATGFHGLHVI IGSTFLTI CWIRQL 
ILHFTSRHHFGLEAAA 


7400 


21301 


A 


7460 


1 


222 


TILLGLYYTLLQA* EYFKATFTI YDGI Y 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K— Lysine, JL^Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














GSTFFAATGFHGLHDIIGSAFLTICFIR 
QLIFHFTSEHHFGFEAAA*YWHFVDAT* 
LCMYVS I Y* * GS T FFAATGFHGLHD 1 1 G 
SAFLT I CF IRQL I FHFTS EHHFGFEAAA 


7401 


21302 


A 


7461 


3 


246 


LN Y S L F FHGE ANLGTTQ VLTH P S TTAM Y 
FVHYCQPP * I L YGT INT * P PWHKNP I K 
KKKKKKKKKKKKKKKKKKKRGGGF 


7402 


21303 


A 


7462 


2 


155 


SRSRAALLLPLP*ALQTTNLPLIVMSSL 
LLI I ILALSLAYE* LQKGLD*AE 


7403 


21304 


A 


7463 


1 


136 


PTRPAPSHLLYCKNIKKLFPSGTANEAL 
I IAD I FLRCKKKKFKVK* * VLKASEE * N 
GCKHLHGMH * NS AVCTVF WLKQ I MNSD * 

KKKKKKKKKKKKKKGGENNK* KNIKKLF 
PSGTANEALI IAD I FLRCKKKKFKVK 


7404 


21305 


A 


7464 


166 


192 


N IN * MW YMHTVE C YS DF KRKE I L IHATT 
WMNLEDIMVSEISQSQKDKYCMILHIEY 
KLYIHM 


7405 


21306 


A 


7465 


416 


55 


LFFLRLGYFFLPNIRVFFFLSTLFFFFF 
VWGFFFFGRFFFFFPPPQEVCFFFIF*G 
I FFFFLFK* KNMFFFFFFLWGPPPFFFF 
SGAPLFFFFFFFFFFFFFFFFFFFFFLR 
KANTKKMF 


7406 


21307 


A 


7466 


295 


84 


IWGNVTLLQLVSCLFCFLFLH*GFFCCC 
CCLFLFLFVCLFVCWFLNLKPS IKCLLP 
FSLSLGCAEI PSS F 


7407 


21308 


A 


7467 


142 


258 


PGTVAHTCNPSTAGGQGGRIP*GHEFKT 
NLANMVKLHLY 


7408 


21309 


A 


7468 


3 


192 


PSEYFESPFTISDGIYGSTFFVATGFHG 
LHVIIGSTFLTICFIRQLIFHFTSKHHF 
GFEAAA* YWHFVDW*LFLYVS I Y* *GS 
TFFVATGFHGLHVI IGSTFLTICFIRQL 
r FHFTS KHHFGFEAAA 


7409 


21310 


A 


7469 


2 


110 


GRVGKHHFGFEAAA*YWHFVDW*LFLY 
VSIY**GS 


7410 


21311 


A 


7470 


380 


8 


TPQKRKKLKKAGEGVFPPPPKKTTPPPP 
PIFIYFIFFFFFIYFFFFFFFFFQYMSF 
FYSHIMIQRHFLNF*ESFYSCISILQ*L 
FKFFSFLGGLIKL 


7411 


21312 


A 


7471 


1 


107 


PTRPHHFGFEAAA* YWHFVDW* LFLYV 
SIY**GS 


7412 


21313 


A 


7472 


2 


360 


VNTLIALLLIMITF*LPQLNGYIEKSTP 
YECGLDP I S PARVP FS I KLFLVAI TFLL 
LDLE I ALLLPLP * ALQTTNLPLRAMSSL 
LWI ILALGLAYE * LQRGLD*AEWLHSL 
TQPTNNN 


7413 


21314 


A 


7473 


423 


167 


VKVHKCFLPGVAPHAFNPS TLGGRGGR I 
A* AQEFKTSLGNI VRPPSDTCNP IVLGL 
QA 


7414 


21315 


A 


7474 


1 


73 


I KPE * YFLFAYT I LRS VPNKLGGV 


7415 


21316 


A 


7475 


131 


3 


I S W VGAE WLTRV I PAL * E AE S GVS RGQ E 
I E T I LANT VKPDAW 


7416 


21317 


A 


7476 


2 


103 


PRVRSTGFSSIAHITRDVNYG*IIRYLH 
ANGAK 


7417 


21318 


A 


7477 


2 


325 


NTLLALLLI I ITF* LPQLIGYIEKSTPY 
E CGFDP I S PARVPFS I KFFLVAI TFLLF 
DLEIALLLPLP*ALQTTNLPLIVMSSLL 
L 1 1 ILALSIiAYE * LQKGLTERDT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=G I uta mine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TJn known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7418 


21319 


A 


7478 


423 


170 


FFSFFFFFFFFFFFFFEIDKNVYLIM*C 
TNPQEKRNLLFCFLVMVS PVCITPHTYI 
CVCVCVIYIYIYIYIYIYIYPFIFSCCD 


7419 


21320 


A 


7479 


433 


50 


RVFFFFFFTPEKKTFFPQE I YLFFSPFP 
L*NFFFSPKPLFFLGGFAPNFPPPKKVF 
FLKFPPGFFFSPPFKKKFFFFFPRFFFA 
PPRFFF *GPPS FFFFFFFFFFFFFFFFL 
TVKFTRGFRGFCGQI 


7420 


21321 


A 


7480 


401 


337 


FFFFFGKFFFPQEFTVFFPPFPFKIFFF 
SLRLLFFWGGFAQFFPPQK*VFFSKFPR 
WFFFPPIiFRKKFFFFFPG * FLAPQGFFL 
KGPPPFFFFFFFFFFFFFFFRPI*VGGR 
VGNNPNPNTNYLPftT 


7421 


21322 


A 


7481 


1 


84 


KOTLPLTIALLI*HVSIPITISSIPPQT 


7422 


21323 


A 


7482 


562 


412 


LFSFHVSGTIVIAHTCNPSTLTG*GRIT*G 
LEFKTSLGNTVRSHRYRKKKIA 


7423 


21324 


A 


7483 


12 


233 


DIFHFTSKHYFGFEAAA*YWHFVDWRL 
FLYASIY* *GSLL 


7424 


21325 


A 


7484 


385 


56 


GNFPTPKKFNFFYPFYLSIFFFFPKKLK 
FF*EIFPIFPPPKFLFLPPNFPVLFFFF 
FLFWEFFFSFLFWFFFFPFFFFFFFFFF 
FFFFFFFF*KTTEFYFVNSNFQHTK 


7425 


21326 


A 


7485 


424 


6 


FFFFFSEGSGLIREDPIGAGALYDYGR* 
LVWTG * TLFVGVY I VIE I ARGNRLCD 


7426 


21327 


A 


7486 


16 


129 


FXXXRXXL,IXKEPSINLE*LYGCPPPYH 
TFEEPVYIKS 


7427 


21328 


A 


7487 


183 


387 


I ASGLD I FLLLLLETGSHS VTQAGVQWH 
NHDSLQPRLTGLK*SSRRSPRGSWDYRV 
LLCHPGWSAEA 


7428 


21329 


A 


7488 


351 


169 


EKKIFFFSPRGYGPPPFFFLKAPPHIFF 
FFFFFFFFFFFFFFRGGGGDHKNFFFFF 
WGFLKNFPLFFFFPRV* 1*1 FFFFFFFF 
FFFFFFFFIiGGGGGTIKIFFFFFGGF 


7429 


21330 


A 


7489 


47 


212 


THASAL FL * I RTAY PRFRYDQL I HLL * K 
NFLPLTIoALLiI *HVS I PIT I SS IPPQT 


7430 


21331 


A 


7490 


98 


273 


TSGSCCSAHILAFKP*AGMATFITKSL* 
* CHTHTHTHTHTHTHTHYMYI YTHMHTY 
IF 


7431 


21332 


A 


7491 


70 


405 


L YGCP P P YHTFEE P VY I * EAFAS KRKVL 
I VEEPS INIiE*LYGCPPPYHTFEEPVYI 
KSRQKRKESNPPKLVSSQPHGLHDFFKK 
KKKKKKKKKKKGGG P L KKPRGG PMLTGG 
GKKNFFFFRGGEKKPPGGFWKKTLFLGG 
GKLGTPPPRRF 


7432 


21333 


A 


7492 


22 


208 


RSSLPARPTRAVI L 1 1 FMI * EAFAS KRL 
VIiIVEEPSINLE*LYGCPPPYHTFEEPV 
YIKSYV 


7433 


21334 


A 


7493 


2 


204 


KLPETVKKKKKKKFYSFRSIVMLLLYLF 
S * SLS FFLLYSLLFFLFLLS FYI YS FLC 
FPLCKTATTEH 


7434 


21335 


A 


7494 


2 


104 


TTLLLSRT*NKLT*LTPLIPSTLLSLGG 
LPPLTG 


7435 


21336 


A 


7495 


291 


160 


PRHF* I FVEMGSHYVAQAGLKLVAS SNL 
LAWASQS AGVTGVSHGS LLITSCL 


7436 


21337 


A 


7496 


422 


180 


LGAPQGVF*KGPPFSPSSSSSYSSSSSS 
PS * QNI FNKI FLFVCFVLETES VTQVGV 
QWRDLGSLQPLPPGFKRFFTITTH 


7437 


21338 


A 


7497 


244 


296 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
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peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
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26 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C—Cysteine, D— Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T~Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFFFFFFFFFG 
GNF*FYK*LLGNRDGVCLPPSSYL 


7438 


21339 


A 


7498 


1 


410 


P TRPDQL I HLL * KNFLPIjTLiALL I * HVS 
IPITISSIPPYT 


7439 


21340 


A 


7499 


51 


478 


PQKYTLIMKICIIPPGKGFRPVEMLAGG 
NENTRCLIEERSYKYHYSLIKLGLQQQC 
IFSSLLYKYNMIYCLMLCYYIGYIFIFY 
DMYYRQYSVLYYKLYNYVMLI* IYILYL 
FMC I * KY I YF I FM* YKY I F I YL YL Y * YK 
YIFYIYLCVYKNIYTLYLCNINIYLYTY 
IYININIY 


7440 


21341 


A 


7500 


3 


110 


FTS KHHFGFEAAA* YWHFVDE V* LFLYV 
SIY**GS 


7441 


21342 


A 


7501 


168 


327 


AFCYKATVIKTI * Y* YKHRYTEQWSRTK 
NPDINPYICGQMI IFLQLKSLHKI 


7442 


21343 


A 


7502 


400 


159 


NNFSPLRNLQGFFGPLKFPMGFQAGPPG 
* GFS F FLRSNFFLDVFRGKY *TFGF*DF 
FYCPSPNEVLGEPPLMGGNKPGFP 


7443 


21344 


A 


7503 


401 


257 


TS Q KKKKKKKKT P KKKKKNRE KP P KKKR 
GTRGE P FKTGGGKE KKKF FKKKKKKGKG 
PP* PQLVKRGGREKKKKNLWKKKGAF* K 
KKKKKKKNP KKKKKKPGKT P KKKKGD P G 
GTF 


7444 


21345 


A 


7504 


44 


344 


LLASLANLALPPTINLLGELSVLiVTTFS 
* SNI TLiLLTGLNI LVTALYSLYI FTTTQ 
WGSLTHHINNIKPSFTRENTLMFIHLSP 
ILLLSLNPDIITGFSS 


7445 


21346 


A 


7505 


286 


252 


I * FAFHH VGQAGLE FLTSDLP ASAS QS A 
RITSVSQHAWPVHNIFYSLL 


7446 


21347 


A 


7506 


3 


213 


REL PGLKRFFLLS LLSSWD * WRMPSHPA 
TFCIFCRDEVLPCCPAGLLILLVWNLKN 
CCCYHHHRYYYCY 


7447 


21348 


A 


7507 


400 


20 


SLPQKKVSLLKLKSAVQLVMF*DRFSLC 
HPGWRA* GQSGVTTAS SLRAR * S S CLT W 
EYRPS PLPLADFLFW* RRGLP I * PRWS 
NS *GQVGIiLHQPSKMLKFSSLSPCARPL 
FFFFNRSTNLRVQS 


7448 


21349 


A 


7508 


349 


227 


SDFLTSANPPASAS KSFGI TGVSHHTRP 
*PRFLKVNLIYSEMEVYN*ASFRHVD*G 
GLELLTL* FTHLGLPKLWDYRREPPHPA 
LTKIFKGKPYIQ 


7449 


21350 


A 


7509 


371 


21 


FIiVETGFHHVG*AGIiEPLASSNLPASAS 
QTAGI TGMSHCAQPMPGSLLNKTMLAVA 
TEQWVGYMWPSVFGHITARPEGKRLSDC 
LHSDTUVCWCVCSGLIFVCFETGCRFA 
LQPG 


7450 


21351 


A 


7510 


330 


14 


D KVC S VT QARVQ WHDHGSLL F L S S GLRQ 
S SHLS LLS S WDHRHMPPGMANFF YF* * R 
LRSHYVAQAGLKLIiAS SNPPTLTCQSAG 
I TGVS HNT W PKAL FS VTDL KH 


7451 


21352 


A 


7511 


417 


33 


APPCGGGPPKRIFGPSRKRGKGPPPPKR 
FF*KNKPPKGGGKKSPPPPKNFFPKKKN 
LGPKKKKSGFWGRFAPPRAPGGGVFNFP 
FKAGGEKFFSRGEINPPRGAGKKKGFFL 
GKKNTKKKTPKKKDL 


7452 


21353 


A 


7512 


81 


296 


CLRE GAS HS AAQAGLHWHDH S S LHP *NP 
GLiRGS S RLNLLS I WDYRCVP PS LAJSTFE S 
CLLiKI I LGS PFPKNL 
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nucleotide 
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residue of 

peptide 
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Predict- 
ed end 
nucle- 
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location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptbphan, Y=Tyrosine, 
X=Unknown, *— Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7453 


21354 


A 


7513 


412 


279 


KKYTQGLGVVAHAYNP STLGSQGRR IP* 
AQFETSLGNTGRPCLH 


7454 


21355 


A 


7514 


74 


126 


KI * DWVPKGAPLQGPVF*NSTFRPRVRN 
RGPPWPRPGPTVWKG*NKKN* PAPRGSN 
PPFFFPGNPGPFKLRGWPGSS*GSFQNC 
KKAQ AG * AP T * KP GP P PP QNKV 


7455 


21356 


A 


7515 


332 


20 


GERQVGSYFFMNTERLFGMTKKF*KWIV 
SMVAQHSSSISGVWVFRGQEIAFPLSPD 
WQVDYESYTWRKLDPGSEETQTLVREYF 
S WEGAFQHVGKAFNQGKI FK 


7456 


21357 


A 


7516 


1 


167 


RTRGQRFTVLARI VhlS *HCDIiPASASQ 
SAG I TGVSHCTRPGLS FLYTP TAKHST 


7457 


21358 


A 


7517 


2 


223 


GRMEEMGSHYVA*AGLEFLGASDLPA*A 
PQRAKILLLLGGHKPGLTILPRLSSNSW 
PHVKWPRWPLKALGLRG 


7458 


21359 


A 


7518 


151 


476 


SLGKFAVS FS FS FLK* RRALAVLPRLCS 
NSWP*AVLLPWLPRVGIAGMSYHTQPDY 
HFQARAVAEILGIiVSTQHSSVTGLPVEN 
SFRAGHGGPRLQSQHFGRLRWADC 


7459 


21360 


A 


7519 


3 


134 


CS FRLGAVAHI CNVITLGGRGGRI A* AQ 
EFETCLDNIARPCLY 


7460 


21361 


A 


7520 


14 


219 


APS IHGFGFLYTYIHTHTHTHVYMYI * V 
YIHTH I ES I KCQPIiTQLQSNKKANRI VF 
RTQFMLTCYFMR 


7461 


21362 


A 


7521 


134 


334 


RKHSC*IFFNFCFYLiSLETRSHYVASTG 
LDLLAS SNP PALAS QRAR I TGMSHHAP P 
VLTRFKKSLF 


7462 


21363 


A 


7522 


406 


145 


FLKTGFSPFGPFVF*TPPPGFPPPPPPP 
IFGVPGGGPFAPPLFFFFPQTKHFPPFL 
NPPSPPPQRGFFRGFFLFPPAFFLKTPP 
FPF 


7463 


21364 


A 


7523 


3 


382 


HPQGEGGRL TS LE * NTS LRLKKKKKKKK 
KRGGPFKKNPWGAQI*PGKKKKNFFLKR 
GAKKNHLGNFGKKPYFWGGKKLAKPPKK 
N*TFKGKKKFLRGKGGKKTPKPWLLKI F 
FSGFYLKKIFPPGP 


7464 


21365 


A 


7524 


319 


106 


GTRS CSVT*AGVQWHDHS SLQP * TPWLK 
RSSWLRNRWDYKHEPPCLNNFYFFCRDG 
GLLCHPGWSQSPELK 


7465 


21366 


A 


7525 


433 


228 


MQ P YNWS VNS S S QL S LS S ARQRVTH VLT 
GNFFI K* ERWDLP VLPRLVSKS WPQVI F 
PPWPSKVLKLQV 


7466 


21367 


A 


7526 


26 


351 


CRSLVFSLPKPGLRFVSILHIQVFSHFD 
F F FLTGNTQ K* RK * TPF I FPA* LFKTPA 
PKPGMVAHTCSPSYSGGLDGRIA*AQEF 
RT S LGNKTN P S LEGGRKKKKP S GS 


7467 


21368 


A 


7527 


438 


62 


QLSKIHRPPAGGRFFKSGGGFKPFCPVG 
IiPPFGKKANQPTPKNPPKKQKPSGGGIiS 
PMGKPP*FPCSMPSDSPGSVSFSRGGVA 
PAPKDYCPIV*AALIPSKKKKKSKPG*V 
KKGKPITL YS I YNK 


7468 


21369 


A 


7528 


3 


132 


FYYL*RRGFPVLPRIiVLNSWLQVIIjLPW 
PPKVLN*AAHHPHG 


7469 


21370 


A 


7529 


2 


628 


FF F L P G WNRVLLMGRGLGE ANS REGRG 
P SQ P RGMHGQLE VRGGRS QGD GWHGTLS 
P PWGR I YRGAP PTFAAPQAPKPFRQLLP 
M*LPGRPSSCLLPETPALSPLPSAEWGG 
FKLLTGTSETPGPSGSPSVGSANAQAAG 
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26 
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nucleotide 

location 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ATTIVARGHTPCTRGPLDTHAHSSLRWH 
AHTRPPPRPLAHEHGTHTHAHPHTPGAW 
VGEGPFQGWGRA 


7470 


21371 


A 


7530 


63 


205 


GRPTRPRTRGVKIL*QFEHTKPTPGLPT 
LIALTTLLLPISPFILI IL 


7471 


21372 


A 


7531 


390 


56 


KTFFFFNLIAFTEAKSVLLFLLLFFETG 
SRSSAQPGVQWHHQSSLQPQPL*LKGSS 
CLSLLSN*DYRCAPPCQ*SPCLSLLKCW 
D YKRE P PHP AKE S HMQQ S YL I YS LY I A 


7472 


21373 


A 


7532 


543 


335 


LAAYGGTCLQS*LFSRLRWEDHLSPGVQ 
GCNELQLQHCSPAWATE*DPVFKGEKKE 
RVLVKVFS 


7473 


21374 


A 


7533 


3 


145 


WCDHGSLEPRSPGLKQSSCLSLPSNWDH 
SSEPLHTTSMWNF* IFKFLNLK 


7474 


21375 


A 


7534 


386 


106 


swvtdeitmei *ecvlndnsdtpyqnlw 
dtekwlrakftalnacmkkseraqirh 
tcvhthththththtgskcllrkyll.lt 

IQQQNIRIN 


7475 


21376 


A 


7535 


398 


2 


SLQPLPPRVKCFSSPSPPRRWDYRGAPP 
RGGKFFFFGKKEFPPVGQGGF* FLPPGV 
LAPPPPQKGRISGGSPRPRPFFFFFKIR 
I S YP I LAKQLKLCFLKKKKKS I QAIKLY 
HHQKNPITKSSVFWESQPRA 


7476 


21377 


A 


7536 


2 


344 


HS KCVPAEWTMAYYIRYL SISS LLH I L 
ENIVHFFFHFLY*GPNNLFFFYFAGTRL 
I QRS WK* PM 1 1 * * VHVQ I KREGQ I FNK* 
IFCMIGRGG* ITCGQEFETSLVNMVKPC 
LY 


7477 


21378 


A 


7537 


116 


224 


LHTHTHTHTHTHTHTHTQ FNKL P * LFI P 
CGHIPRRY 


7478 


21379 


A 


7538 


1 


383 


WHERTHSRI I ILFQGLETLLPLIAF* * L 
LAS LANL VL P P T I NLLGE L F VLVTTF S * 
SNI TLLLTGLN I L VTALYS LY I FTTTQ * 
GSLTHHINNIKPLFTRENTLMFIHLSP I 
LLLSLNPD I ITGFS S 


7479 


21380 


A 


7539 


31 


332 


DNYLSSYSSSGSWMRELILASQGIRWEP 
I VDRT P S HHRTHTHTHTHTHTHTHTHT V 
LFRLGPWTHAREPRGHICGN* IRNQSP* 
KNPRTRGKNLRTPHGW 


7480 


21381 


A 


7540 


199 


15 


NNVQIK*QFEHTKPTPFLPTLIALTTLL 
LPISPFILIIL 


7481 


21382 


A 


7541 


337 


157 


AEIVPLNSSLGNRVRLH*MESYSVAQAG 
VQWHDLGSVQPLPPGFKRFSCFSLLSSW 
DSRLYFRCHD 


7482 


21383 


A 


7542 


28 


313 


RARI GS DQ CLS AGTQ WENKS Q I SGDFM 
WNDFRS * S VTRVGVRWRNHGSL* P WIPG 
LR*SSGFGLPNCWDYRDRNEVWGGKKEV 
TCPTVNLQKSQ 


7483 


21384 


A 


7543 


493 


180 


GGFRGSWLVPRDGGQGVGFCSLPPPPSR 
VKQFFCLRFPRRWGGREGSPRPSYFFFP 

-fci/"rjt3 'ctpttt r i 'D\T\n "nttpt c* r^r* t tr* Trn t»t t» o /o 
*xl»is.i^Kif r J? LAjKv VJbJNr JjGUGVGPPWFoQ 

RVGFPGLTPGPRASFFFNLI 


7484 


21385 


A 


7544 


403 


426 


FF*KFFITFHTLFFFFFFFFFFFFFFFF 
FFFFFFFFFLAPPPGGFFFS FFYQKI FL 
KFPGQPEKFALPFGGGFFF*GKFPQFFF 
CFLVFFFFFFLFLENGVSLYCPGRSRTP 
GLKLSSRLLASLRAGITGVSH 


7485 


21386 


A 


7545 


3 


335 


RHYSDYPDGYST*NILSSEDSLISLTAA 
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NO: of 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C-Cysteine, I>=Aspartic Acid, 
E=Glutamic Acid, F— Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L— Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, -=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LliMVFM I * E AFAYRRTGL I A* E P SMNLE 
*LYGCPPPYHTFEEPGYIKSRLKRKESN 
P PKL VS SQ PHGLHD FFKKKKNRGGP F 


7486 


21387 


A 


7546 


424 


140 


RRVKFVGGGAPQGPPPKRGVLPKIPREK 
VFP P P PG * NGP PGPAFKTP PKKKKN I S S 
PPPGNWAPPGGLLKGPPPFFFFFFFFFF 
FFFFFLNPLK 


7487 


21388 


A 


7547 


2 


348 


FCH VAQ S GLKLLAS SDPPTLAS QSAR I T 
GMSHHGGLK* GE FLGPSSNKGALLLWEA 
TWLQS FMKSASLFVKPRPTFVYFVFAFV 
G YRD I YF TAS S L I KV I QE GL KYTMKCAP 
LTL 


7488 


21389 


A 


7548 


2 


90 


RKQAYTHTNTHTHTHTHTH* KKTGIHAH 
KHTHTHTHTHTLKAEKQIEGGK 


7489 


21390 


A 


7549 


256 


405 


CELYSGKEMELVFGLFIFTGVTRLETFY 
KKE CS S FFHSQS RGHKLQPS ILNE W WH I 
RNPLGLTHFLLL* KTGVAE YKNSLNWH 
HPFFLSYAVSFLLQGWSFAMLSKLGSNS 
*VQARCLPQSPKVLGI,KESPQERTVNVS 
SIRGKKWSWYLDYLFSQGLQGLKLFIRK 
S VHHS S I PRAEGINCNHQY 


7490 


21391 


A 


7550 


2 


362 


ILIMNTLLALLLI I I TF*LPQLNGYVEK 
S TP YE CGFDP I S PARVPLC I KF FLAAI T 
FLiLYEIjE IALLLPLP*ALQTTNLPLIGM 
SSLLLIIILALSIiAYE*LQKGLD*AELK 
EEQKTLQC 


7491 


21392 


A 


7551 


1 


317 


TTFDNSALLFFWDGGEIEFHSVSQAGVQ 
WHNLCSL * PQPPGFKQKLI LPQ I VKENV 
SKISSQLLFSRVNINISPSEQCITPPSL 
QLARIGI FYAAKIHLTKGLRG 


7492 


21393 


A 


7552 


479 


183 


YLFHVHTTASHSNGACTGPT VGD 1 1 VS S 
TTi* SMTEQVTMPLASAVSDGTVPSVRTA 
SRGSEQAAESMVSSEHIEILEHAGELVI 
AS PEGQLE VQTVI V 


7493 


21394 


A 


7553 


454 


105 


KPPWGNKIRPPLFPGRKVPPPRF*NPVR 
GS PPKK* FVFPVGRVLGTGVTPLS PFLK 
TTPLLLWD P PLS Q P PGGVKPENS Li YPGK 
KRFR* PKFPPCPPAWGTKRKPPLQKKKK 
RQVL 


7494 


21395 


A 


7554 


439 


1 


LLKRCVRKDSPPPQNKIFFFILKKFVFF 
FAPY*VRKFYFLTAHFGKRPPQIYIFGP 
PPVFLIFCCFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFNKQVFI 
ERLLCARHCFRPPPEEKTKNKIEAANPN 
SEGEER 


7495 


21396 


A 


7555 


323 


82 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFR* *AYL 
VRS P LL F I F VF S Y F * L I * HVYHR YS VY I 
ELDVLLTGSKIILI 


7496 


21397 


A 


7556 


194 


412 


KMYRTS LGHLVE AKKKKKKKKKKKKKKK 
XKKKKKKKKKKKKKDSRGGVV * KKFGGG 
HNTRGVKI I FFFSLGG 


7497 


21398 


A 


7557 


414 


219 


KGFKKL.FPPPVF* FFLCPPPI FLKGFP * 
GRKPPPPKNPQGGPPFFFFQTPPPLFFF 
FFFFFFFF 


7498 


21399 


A 


7558 


377 


25 


THVGGVFWGKFFFSRRVFFFFYHLIQIL 
SPPPQKRPPPKGTPEG*IFPLFKEINFF 
FF*DFFFAPPPFFFFFFLFFFFFFFFFF 
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nucleotide deletion, \=possible 
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FFFFFFFFFFFFFFFFFFFFFLKVKTGV 














YCVPPIAVILYR 


7499 


21400 


A 


7559 


386 


2 


FFFFSFGEIFPPPGISGFFPPFPL*NFF 
FPLRPLFFLGGGAPFFPPPKKGFFPKIP 
RGVFFPPPLREKFFFSPPP*IWAPPGFF 
LKGPPLFFFFFFFFFFFFFFYRKARTKP 
I CLWGDVS PS KHFQCI 


7500 


21401 


A 


7560 


1 


396 


KI CHR ILRPI KKVK* E KKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKGGGAS 
LKIPSGGPQLPRGGBKNPPPFLGGEKKP 
PGVFLKKHLFLGGGHFGPPPPKKNSPWG 
EKKUFWGGGGKKKQNIPGGKKI PSHKKK 


7501 


21402 


A 


7561 


1 


390 


N YVFKE FSDKLGAVAYT CNP S TLGG * GL 
WIT* GQE FE T S LGNM VKP YL YQ KYQ KKK 
KKKKKKKKKGGGPFKKTLGGAQFNGGGE 
KKFFFFLGGVKKPPRGFLGKNP FLGGGK 
MGAP P P P K I KALGE KKNF 


7502 


21403 


A 


7562 


406 


182 


FVFFFFFXXXFXXXXFXFXXXF*TXXFF 
FFXXXXFFFFFFFFVCFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFF 


7503 


21404 


A 


7563 


2 


113 


FHFTSKHHFGFEAAA*YWHFVDW*LFL 
YVSIY**GS 


7504 


21405 


A 


7564 


313 


97 


S PT YP C YIRNS AQS FSL I AKS FCSHPHT 
YHHDPHLEKSLTYCPQS PS * P I IPPVTC 
TYINRWPEATEEPQNK 


7505 


21406 


A 


7565 


3 


144 


DAWVF VATG FHGLH VI IGSTFLTICFIR 
QLIFHFTSKHHFGFEAAA*YWHFVDGV* 
LFLYVSrY**GSTFLTICFIRQLlFHFT 
SKHHFGFEAAA 


7506 


21407 


A 


7566 


1 


130 


FIRQL I FHFTSKHHFGFEAAA* YWHFVD 
W*LFLYVSIY**GS 


7507 


21408 


A 


7567 


434 


1 


PDLNKVI FFPHRKKKEPYKCHI VGKNLR 
NEM Y S I QDTNSL PRRER * RKTRT C 1 1 F P 
HK I KTS EHS FD I KE KH*N YVTL CKVLRK 
LSTNQRQS * I FLS TNKNDTLGRARWLTP 
A I P AL WE AEAGG S * GQE I E T I LANKVKP 
RGRV 


7508 


21409 


A 


7568 


441 


86 


NFSFREKGGEFVPPPPLKIFFFPPPPNF 
FGGGGPPSPPPQKKVFFPKPPRGFFFPP 
PKKKKNFFPPPQKIGPPPSFF*TPPPPF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFL 
MFVCLY 


7509 


21410 


A 


7569 


1 


155 


PTR P DLRS LRF E AKS PNSRRTLHKPGVT 
I * EAFASKRKVL I VEEPS INLE * LYDYG 
FFYYKIKEPVYFYSNQKKE 


7510 


21411 


A 


7570 


434 


1 


SPTEAGGKNFFKKNPGRKTPPKKKKNIF 
FSPLTPKKFFFPPRG* ILGGGGGPNCPP 
PKKGVFPKNPQGVFNTPPKKKKKKFSPP 
GKNGGPPGVFLKGPPPFFFFFFFFFFFF 
FFLRQSHTLiLPRLECSGA 


7511 


21412 


A 


7571 


493 


196 


S SLDCS * SJjQGS I * FIjTLMMIiAVDFVD V 
LY* V* RVLYVLVQVAI TKCHI VSGLNNR 
HS FLTAVKARKS K I KVP ADLVPGE VS L P 
SDFS 


7512 ' 


21413 


A I 


7572 


1 


239' 


■KGLT I LDAIKNI CDS * KGVKI L I FSEVW 
KKLVPTLTNDFERFNTS VE E VTAHWET 
ARELELEWPQDVTELPQSHDQT 


7513 


21414 


A 


7573 


391 


32 


F FPQ P S P PGVKGFF CLNLPNNWGFRAGP 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GItttamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon y /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPPGYFVFLVNKGVFPVGPPGFELPSPS 
*VAPPASQKGGVSGVSPCPRVWVPFFKK 
KKKLRGVEQRCTMHLLVYSKNVPVKPQK 
KSSIGSL 


7514 


21415 


A 


7574 


2 


383 


PRVRSTAFSSIAHITRDVNYG* IIRYLH 
ANGAKKKKKKXKKKKKKKRGGALKKKPG 
GGQKKGGGEKKNFFFKGGEKKKPGGI FE 
KKTFFGGGENGAHPPQKKKTPGGKKKIL 
RGKGGKKPLYPWGEK 


7515 


21416 


A 


7575 


402 


69 


KI FFKNSPLWKI SHPPTYRGFFPPFPP* 
NFFFLPGPYFFLGGCSHLGPPPKKGFFQ 
KIPPGLFLSPPFKKKIYLFPPRIFLAPP 
GVFLKAPPPFFFFFFFFFFFFFFFFF 


7516 


21417 


A 


7576 


1 


119 


LI FHFTSKHHFGFEAAA* YWHFVNW* L 
FLYVSIY**GS 


7517 


21418 


A 


7577 


219 


69 


■ ND I SANTAI KKFWKLPKCPTTEGWLNT* 
LYIHLVEYYATLKVCGDLYVR 


7518 


21419 


A 


7578 


3 


287 


HAS AQLNGYI EKSTPYERGFDP I SPARV 
PFF I KFLLVAI T FLLLELE I ALLLP LP * 
ALQTTNLPLIVMPSLLLS I I LALRLAYE 
*LQKGLE*AE 


7519 


21420 


A 


7579 


10 


125 


ALQTTNLPLVWSSLLLIIILT*ALQTT 
NLPLWMSSLLLI I ILTLILAYEGLQKG 
LN*AE 


7520 


21421 


A 


7580 


3 


299 


DAWVRLLDFDF*VELPTEARIRIIITSQ 
DVLHS *ADPTLGLKTDAI PGRLMQTTFT 
ATRPGVYYGQCSEI CGANHS FMP IALEL 
VPLKI FE IGPVFTL 


7521 


21422 


A 


7581 


3 


116 


AFAS KRTGLGVEE P S LTLE * LGGGPP P * 
HTLEEPVNI 


7522 


21423 


A 


7582 


398 


2 


RVFPAPKNSLQIFPPLFFFWGSPWKFFF 
TPFPHFVFWFGNFKFNPFFAFLFRLPFP 
GEKIPLVFFFFFTD*NFVENFFFFLSLL 
FFFGFFFFFNM* S * LF*SPANFNV* VTH 
LF I FMVFFHPDFWS FGPILW 


7523 


21424 


A 


7583 


288 


9 


RGSKNLEKNSFPYKVNNHGNGSPLFPHL 
FFSPRGVGPFFFFFETRSRSVIQAGVQW 
HDLSSPPGLK*SSHLSIQSSWDY*RAPP 
HPANFVYF 


7524 


21425 


A 


7584 


1 


124 


RDGGFTMLPRPILLTGLKQSSCISLPK* 
WDYRCEPLHLASC 


7525 


21426 


A 


7585 


339 


76 


THPGHNG ISVSP KKKKI PRGGGP PPLFP 
PLPRVGGGDSL* PRKGGFN*TKPAPFPS 
S WGKKKE L PF S KKKKKKKRKKGS I S T P F 
SRIE 


7526 


21427 


A 


7586 


205 


87 


SISLSLSLSHTHTHTHTHTHTLSHPRCV 
PQPIGLPISKWAKVKELIT*R 


7527 


21428 


A 


7587 


353 


256 


PHPSVTTLVTSQQYQDPPPAKRL*LAEG 
SDDC 


7528 


21429 


A 


7588 


159 


195 


QICRHRWHINFLMSMGSIMITPPSFLF 
LNFL* S YVAQAGL KHL S QS D P PAS AS Q S 
AEIRGMSHNAWPHF* YFVELVHFI YINK 
FVGIELFILTS 


7529 


21430 


A 


7589 


76 


205 


RKKPE P EKTCFDN I TGNTYP CLSTYEHD 
SD*YYWRT*LCAER 


7530 


21431 


A 


7590 


218 


20 


LSLYLASHLWLKPPHPLEWAPSKSSLRL 
GT VAS P CNPS TLGGRGGWVT * GQE S ETS 
LANMAKPCFY 
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peptide 
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Predict- 
ed end 
nucle- 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7531 


21432 


A 


7591 


142 


12 


Q IDYYYYLF I *RKGLTLMLRLVSNSWLH 
I ILLPWPPKVLGLQM 


7532 


21433 


A 


7592 


2 


224 


Q I IEALL I TILLGHYFTLLQA* E YSKAP 
FTISDGI YGSTFFVTTGFHGLHGI IGST 
FLTICYNPQLIFHFTSEHKFGLQAAP*Y 
WLFVDVARLFVPAS I Y* *GSLFYPPTSL 
RVLQG S LHHFRRDLRLiN I F CNHRL PRT P 
RHYWLNVPHYLLQPPTNISLYLRT 


7533 


21434 


A 


7593 


1 


323 


HAYGNTLVA* LWTMTF * LPLVDG YI KK 
FTPYEGGFEPKSAARVLFS IKVFLVAIT 
FLLSDLQSALLLPLP*AVQTTNLPLIVM 
AALLLD 1 1 LALSIiAYA* LLKGLE 


7534 


21435 


A 


7594 


1 


104 


RACVCVCGVCVCVL CLLKMCVCVG WC V 
CVFCAC * KMCVCVC WCVCVFCAC 


7535 


21436 


A 


7595 


383 


43 


VFISETLCEHVAWSQTTIESLGLGTVAH 
TLiNAS TLRG* GRM I A * GQ * S KTRRGNMD 
SISSHSQT I AQ S PHLLGRHQ P KW CMC VQ 
AAEAVIiKGQVETDASFQIQKIPINSLIN 
L 


7536 


21437 


A 


7596 


2 


331 


WPPTGITPLNPLEGPLLNTGGLLA*GVS 
MT* AHHS LGENNRNQVI QALVI T I LLGL 
Y FTLLRASE YFE S P FT I SDGVDRS TFF V 
TTGFHGLHVIIGSTIiLSIWFIRQLICNF 
TS KHH FGF KAVA * Y WLNFT KYLVH P PTN 
M 


7537 


21438 


A 


7597 


384 


23 


LFKTKQYRFLLPPFPLKIFFFPliKA*IF 
LGGFSHYFPPPKKGFPPKSPPWF*IPPP 
YRKKHFCSHTPKNLAPRWFF* KRPPHFF 
IFFF* *RWGLiAMLPRLFSNS *AQAVLLL 
QSPEVLGL 


7538 


21439 


A 


7598 


401 


189 


RVLPCFAGWS*TPGLKCPTCLGL*KCWD 
YRREPPCLIRFYLFVYFLRQGIiAMLPSIi 
VLNS WP * VTLLP * S P 


7539 


21440 


A 


7599 


81 


368 


KCVT YPFLS ITLGKAKYDFFFFLEKELA 
FFPQGEMQGGNLG*LKPPPLR*RDFSCL 
AL PRGGNGRGAP P S P TNFG FLGGNGV Y P 
SGPGGFETPDLK 


7540 


21441 


A 


7600 


264 


25 


AVEHPQLRLFCSHY*NNIVEERLGVLAR 
TCNPCPliEDKAAWI T*RHEFETSLANMV 
IFCLDIGSKSFLSMRFSRCLLWKL 


7541 


21442 


A 


7601 


1 


196 


WEPSLVGETNVNSFNQKYINWPGAVAHT 
YNAGTLGGQGGW IT* GQELE I TLS KKVK 
PSTYFTNTR 


7542 


21443 


A 


7602 


315 


82 


TCTQVFIALFITAKKFKLICPLTDNWIS 
RVWYVHTMKHISAI KRKAV I HATMW VKL 
ENIILSEIKQSQKATGCMISFI 


7543 


21444 


A 


7603 


334 


118 


P KI KT E GGP P PKKNAGGM YK * KKF I L VL 
AAPPHPPFGVFSVFSFYSFWFFRFFFFF 
FFFFFFFFFFFFFFLR 


7544 


21445 


A 


7604 


3 


288 


DAYTT*NILSSVGSFISLTAVILIIFMI 
* E ASS S S S S P PGGLRPRAFP VRPPGAGL 
VF*VSVPPDLPQCSLAGLQVLRFMELAR 
LAPHIRWIIQ 


7545 


21446 


A 


7605 


354 


33 


EGRARYTRVPQRPRFHLLGIFFAPQLGK 
KKKFSFSKKKKKKNQLWLGAVSHANNLS 
TKGG*GRGSRFT*A* EFRTSQKLGTKGN 
MVKPISTNTFKNERGGAELSQL 


7546 


21447 


A 


7606 


401 


161 


YLVSHLLQI ILPPALF WIFFF*DRVLL 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibIe 
nucleotide deletion, Y=possible 
nucleotide insertion 














CGPGVUS A WWS YCS LCLLGSNS S PCQS L 
PSSWDYRHLLWRFLMPRGILCVYV 


7547 


21448 


A 


7607 


354 


50 


S GAQD PAG QHGQ T P F S QKVP KKVYPAW W 
CM WP AT Q E A* LKKS LE P WRRRFLLRE A 
FFVYLG VWBSNGEP P PLGGGGS VFLEAIi 
EKSVGFSSLGFGWVFFF 


7548 


21449 


A 


7608 


370 


115 


T I KN I CNS WE VKI PRFTGVWKKLj I PATL 
MDDLEGFKTLVEEPTEDW* ITGELEVE 
PENVIEFLQSHDQTSTDEGLLLMDEQRR 
W 


7549 


21450 


A 


7609 


120 


358 


TTAVYLGH YCQ S P * I LNGTVNT * PPWH 
KDPILFLTPFPCFHASSAMNLQIiSHISC 
NS KATPHP LGYQQT Y P PLTVHMT 


7550 


21451 


A 


7610 


3 


164 


HE* AS IP*NTHTHTHTHTHTHTHTHFYK 
EENLSHLWNI SC I FLGAHKGKKKMN 


7551 


21452 


A 


7611 


85 


324 


FRFFAFFFF * KKI S FCPQAGGQGGILPS 
LDPPPPGLKKFSGLTPPRSWN*GPCPPP 
GLIGVFIjKKRGFPIjVGREGPNLWT 


7552 


21453 


A 


7612 


268 


317 


FL.ENCFNPGGGGCSQPKLGPCPPAWGAK 
*ASLSKKKKNKNKN*LSYKAPSYGYKGK 


7553 


21454 


A 


7613 


101 


339 


AVPLTMVKIHALWKRWRFHNKSKL*LP 
CDPAISL.VSMYPKEMKSVCLKDVCSPRL 
I TGPLT I AGM WNP PNRSSMD YS G 


7554 


21455 


A 


7614 


366 


208 


LELLTSGDLPASASQSAGITGISHRARP 
GTLFF*AVNGC^NQVFLFLRVIiNGL 


7555 


21456 


A 


7615 


30 


328 

• 


NYCLDFHGETELGTTHFLTHPSTRAMVF 
EPYCQAIi* I Li * GV I NT * PT VLHRNRS H I 
KAACLCLQARTWNLQ1SHINWKY*TTP 
HPL*YQQTYPALEYX 


7556 


21457 


A 


7616 


39 


159 


TPGLKQ F FHLGLPRCWD YRHE P PRPCPN 
MYIjI*ISF*CV 


7557 


21458 


A 


7617 


290 


91 


KMSFLIFFRDMASLCCPGWT*TPGLKQS 
SCFHLPRSWDHRCTSLHLAVKTFLYYFL 
KMFYYQNVWH 


7558 


21459 


A 


7618 


37 


246 


T Q QEL VY I YR Y I DTHTHTHTHTHTHTHS 
HTH YW I YT W KHTL KG P KLF * E KT KMGS T 
P I ISWKERLSLYRS 


7559 


21460 


A 


7619 


23 


190 


IPPQPCPFLLSLFCT*IDMSCNMCACVC 
VCVCVYVCVICDMVLLPFCFCKLECSK 


7560 


21461 


A 


7620 


367 


2 


FFSPKALIFLGGGGPIFPPPKKSFFSKI 
PPGGFFPLPGFLKPPPPPLF*NPPLKKK 
NFFFPPPENWAPPRVFFKGPPPFFFFFF 
FFFFFFFWSSRFLIFPQFFFLNNKIGKK 
KIFRHLGAG 


7561 


21462 


A 


7621 


6 


25 


SVILQSIIIF I ATTNLLGLLPHSLTPTT 
QLS INLAMAI PL* AGAE I IGFRSKI KNA 
LAHFLPQGTPTPLIPILVIIETISLLIQ 
PIALAERLTANITAGHLLiXlHLIGSTTLA 
ISTI*LLYYNP 


7562 


21463 


A 


7622 


383 


67 


AFLNPGKKP F P PG F KRF F C P P P P KKWGK 
NRGPPPRGKI FFFFFF V KTKFFHVGPGG 
F* ILTSGDPPGQQGGLGGFFFFFQIGGT 
GDLNLSHKS PMPFT INTHNKG 


7563 


21464 


A 


7623 


372 


35 


TGGFWGVFFFFFFKQGFPFKSTGGNLGP 
KNPCPKGPRVSPPPPPF*NRGPQGGPPP 
PGKKFPF FFFFFF I ETGFPHVALAGLEIi 
PGSGNLPTSVSQSAGIPGISTWPNGVFQ 


7564 


21465 


A 


7624 


287 


6 


LSGVEWVDHMINICLFFKKLPPCV*KWL 
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peptide 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, LHLeucine, M— Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possi ble 
nucleotide insertion 














YQP VAVAHACNP S TLGGRDRWI PGGQE F 
KTTCLSLPKRWAHRREPLYIiAFCPFYNG 
IGCLLIIGF 


7565 


21466 


A 


7625 


271 


277 


NRGKGKTGSLGILEKNPFFGGGPFWANP 
P P * GL KFWGGVG P KR P P P KKRVF FQNS Q 
GTSFPLPPVLKSGPGLVLKKAPQKEKNI 
TFQPPGKFGPPRGSLKRAPPFFFFFFFS 
LRKSSVIEK 


7566 


21467 


A 


7626 


2 


269 


LLGELSVLVC* FPPKHTTFLLTGLS ILF 
TGLYSLYILTTTQWGSLTHHIINLKPLF 
T P KNTLMF I HL Y P ILLLS LNPD 1 1 KKKK 
KGGAF 


7567 


21468 


A 


7627 


397 


164 


FPPPKKVFFQKNPKKKFSPPRVF*TPPP 
PPFFLPPPKKKKFFFSPPP*IWPPPGIF 
*KPPPLFFFFFFFFFFFFFFK 


7568 


21469 


A 


7628 


3 


324 


DGMNVSTLYGATGSHGLHDIMGSALLTI 
WYIRQLILHFTSKHHFGFEAAA*YWHFV 
DVD * LFLCGS I Y * * GS PGG P KFSRGGKG 
IAVPHWGDD 


7569 


21470 


A 


7629 


2 


246 


SGLGTTLS * KNI S VLLTGLKI IjVTGLYS 
VYI FTATQWGS LTHHINKIKP * LTRENT 
LMFIHLCPIIiLLSLNPDISYPGFTS 


7570 


21471 


A 


7630 


112 


290 


LI SGKGQWTQMRIiPVLVTLFTDSFSK* K 
NPSTLGEQCGRIT*GQEIiETSLGNIVRP 
HLY 


7571 


21472 


A 


7631 


248 


2 


KVPASGKRPEPHWYPYHAVPGCGRIPAT 
AIWTQKWYQGFPPALSPRLECSGAISAH 
CYFGFPGPSDS PASPS * LAGTTGTRP 


7572 


21473 


A 


7632 


383 


106 


GWGQCFKPAI PALERPNPQECFRPGVLN 
QP * PQNKSPFFFFFFKKKKKYI YICMVV 
HAYNPSYSGG*GGRIT*AQKFKTSLNNT 
VRPHLYKK 


7573 


21474 


A 


7633 


435 


133 


YACLGLPKCWDSRHEPPGLANFCIFSKN 
RGFPISDRWINSWPRVFG* PKPPKMGG 
FQTLVQGASLGFLLIFLKTGFPFFSQHE 
EQGWDTS S FQHRFFRGK 


7574 


21475 


A 


7634 


1 


209 


NAYRIVILCQKLFPLLSGKMNFKKTRCW 
bGMVAHACSPSTLGG*GGRIA*AQEFGT 
SLGS ILRPCHCKK 


7575 


21476 


A 


7635 


117 


495 


STSFIDKVQLRHSFLFFSLFFETKSCSL 
TQTGVQWHDFGSL*PPPPRFK*FSHLGP 
LSSWDYRHAPPRPASFFVFLVEMRFHHV 
GQAGLE L LT S GDP PAS AS QS AG I TGMS H 
HALAHLYFSSGKS 


7576 


21477 


A 


7636 


1 


241 


LPGSSDSPTSASQVAGITGMCHHAQLFF 
FLAF * L I FNF * FFFGNS FFFVPQAGGHW 
GIGG* LNPPPPGLKRFSCLTLWGG 


7577 


21478 


A 


7637 


3 


93 


EAEAGESLEPGDRGCSKLRS CHYTPAWQ 
QSVTLVSKKETKNTKTSEVP FGVLNLLY 
KVS INI L I FKE I FYQAPL I LV* DNPDS S 
TLSSALQPGQQSKTVSNKTKQKTVS* PG 
NRV 


7578 


21479 


A 


7638 


46 


228 


GIPHIQPSKSVFNNVHGPGTVAHTRNPN 
TLEY*GGSIT*GHEFETSLGNIVRLPPP 
SLQSI 


7579 


21480 


A 


7639 


1212 


1023 


KNHSNEWIKKM*YICTMEY*AIRRNEIM 
AFAAT*MKLETIILSEVTQEWKTKYCMF 
SLVSGS 



1016 



WO 01/64835 



PCT7US0 1/04927 



SEQ1D 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQIB 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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Amino acid sequence (A=AIamne 
C=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, F— Phenylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7580 


21481 


A 


7640 


479 


318 


AEFLNLGDGGCSEPR*HHCTPAWATRAK 
FHLKRKKK 


7581 


21482 


A 


7641 


470 


172 


RNIRPRENQYPVFIWAFFLIAKNW*QPK 
CP S S EE AMNKLRC I HTTAVLHS S EEEQ P 
TDTHNNSDLRS IAPS WRSQTRAAAHRMI 
PWHSGKGKTPSTENA 


7582 


21483 


A 


7642 


2 - 


201 


AGAPPPAWLPPCRLISDOASNQRDSVG 
VGPS EP YATVRKYLELLLCQMHHDMCT Y 
RFSIRIVLNL 


7583 


21484 


A 


7643 


521 


385 


GGSPLETH*YTSQGGGIPYTDLTGHHPS 
QGRIQEAPKLTHL 


7584 


21485 


A 


7644 


331 


3 


DLVPKKGGGGKKTTPRIKKFPPKGEDFF 
FFTPHYFWFWVFFWGGAFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFT*DYYIS 
F*EERFNKFYRCQSTLVQIGHTSNL 


7585 


21486 


A 


7645 


88 


367 


HCS FLLLGKKKKKKKKKKKKKKKKKKKK 
KKKKT KKNE Y KNLIiGGAP P * KPPSGALY 
KYRRENFIPFSFERVSKYPPRDLCKKNL 
FWGGLNIPS 


7586 


21487 


A 


7646 


378 


37 


FFFPXFFFFFSPPPXFFFLFXFFXXPFF 
XXSFFSLPPLPLXXPPPPPPLFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFNI*TI 
VL 


7587 


21488 


A 


7647 


34 


293 


TGIPGVTSAIFMI* EAFVS KRKVLI GKK 
PSINLE*LYGCPPPYHTFEEPVYIKSTQ 
KRKESNPPKLGSSQPHGLHDFFKEKKKI 
IK 


7588 


21489 


A 


7648 


22 


106 


KNFLPIiTIjTLL*KNFLiPLTLTLIjI*YGS 
IPITISSIPPQT 


7589 


21490 


A 


7649 


1 


231 


FDRT ISKDISK* MVLVSKKKKKKQKKKK 
KKKKKKKKKKKKGGGPFKKTPGGAPKWP 
GEEKI I FFFLGGGKKNTWGFF 


7590 


21491 


A 


7650 


79 


286 


WSRCLVSLRNPCI I KTCS * AKKKKKKK 
KKKKKKKKKKKWGGAFLKIPWGGPIFPG 
GGKFFFFFFGGGY 


7591 


21492 


A 


7651 


57 • 


375 


SGKSEFHRVPQWPGTGADACMPSTLGGR 
GRW IT* GQEFETS PQGDP I S TKKKKKKR 
GALLKDSLGGPNLPGFGNLKVFS FRGG I 
liKPTWEFWEGTFILGGEKIGPN 


7592 


21493 


A 


7652 


3 


357 


IiAFLLALSKITHASIPIPVSSPSKSPRS 
KGTKKKKKKKKKKKKRGGPLKKTLGGPK 
INGGKKKKI FFFKGGEKKTPGGI LEKKL 
FLGGGKMGPNPPKKI KPLGEKKNF *"GEK 
GEKKP 


7593 


21494 


A 


7653 


113 


378 


MGAFNLGPFIjLMGGGNPSGPNGHWVLGA 
GSYLGPGKTPPRKSQKGGPPPDL*GRGG 
QCSPGTKPGGGGEKTGLAPPFTKSPLGF 

lqkk 


7594 


21495 


A 


7654 


341 


147 


NFFFSLKPFIFFGGFCPIFPPPKKSFFS 
KIPPWFFSPPF*EKIFFFPPPLNFAPP 
RVFFKGPP 


7595 


21496 


A 


7655 


395 


3 


FFFFFFFFFFFFRGKSE*FVLFPVSPAP 
SLRG 


7596 


21497 


A 


7656 


2 


162 


ESKGYTHTHTHTHIYTHIYTHTYTPREH 
* PKKADVAMLILNKIiFFKPRCVALL 


7597 


21498 


A 


7657 


333 


28 


LEFFFNFFFS YFLFFS *RVWGPERPPPP 
LKKAV*KFFLFDIFIiFFFFFIFGLGFFF 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F~Phenylalanine, 
G=Glycine, H=Histidine> I=Isoleucine, 
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FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLEH I RLLMF VTFS S HR 


7598 


21499 


A 


7658 


1 


137 


HSSLGDRVRPCLKKKKKKKEKKKKKKKK 
KKKKKKKKKKKKGGGL F * E KKKKKKKKK 
KKKKKKKKKKKGGGLF 


7599 


21500 


A 


7659 


236 


32 


1 1 FL I KFLYF* F I F I * F * FFLTFF YFKR 
LKILFFFFFFFFFFFFFFFFFF*KNFIK 
NI I YFFI FYLFYYLKFFI YIKTFLFLKI 


7600 


21501 


A 


7660 


301 


65 


KAPPKIFWFFFFVFFFFFVSFYIFFFFF 
FLFFFFFFFFFFFFFFFFFFFFFFFWKS 
LAAT*FLRGFETF*LAYILKTT 


7601 


21502 


A 


7661 


70 


273 


KKKKKKKKKKKKKKKKKGGGALI KNPWG 
GPIHRGGEKFFFSFLRRD 


7602 


21503 


A 


7662 


1 


353 


ILI INTLDALLLI I ITFGLPQLSGCLQK 
STPYEGGSDPISPARGRFSLKGPLQAMT 
FLL*ELEIALL*PLP*ALQTTNLPLIDM 
GCLLLNIILCLSLAYE*LP*GLDCSRYH 
SQR1Q 


7603 


21504 


A 


7663 


1 


131 


F I RQL I FHFTSKHHFGFEAAA* YWHFVD 
VA*LFLYVSIY* *gs 


7604 


21505 


A 


7664 


2 


192 


PLSQSLY*LLAADLLILT*IGGQPVSYP 
FTIIGQVASVLYFTTILILlPTISIilEN 
KIIiKWAY 


7605 


21506 


A 


7665 


356 


2 


TFFFLSSPGGSCGSPTPARPKKNPPPWG 
GGLPLFSRRGALFPKNFFWGGYPFFFLF 
*KKPPFFPPPGPQSPVTSPKDWPPLRM 
PPPRPHVRPLGLPKKSFSSPRWEKQVKK 
TKKRAA 


7606 


21507 


A 


7666 


312 


2 


GFFFFFFFFFFFL*DRVLLLLPRLECNG 
VISAH 


7607 


21508 


A 


7667 


1 


233 


FWLS SNAE FDSANS CE CLEVQRM I QDQD 
SFPTYHYFDMYVCIYV**RSLAVSPGLV 
SNSWPQAILQPQPPQSLGLQE 


7608 


21509 


A 


7668 


174 


295 


IFFLPFCGEHSLAVLPRLVLNSWTQAIL 
PPLPP*ILGLQA 


7609 


21510 


A 


7669 


183 


330 


NKFKLYWS P VAMAHACNPRTHRGGW I T* 
CEEFETTIANMVNMVKPCLY 


7610 


21511 


A 


7670 


3 


319 


TSNTLLALLL 1 1 1 TF * LPQLNG Y IEKST 
P YE SGFD PIS P ARVP FY I KF FL VAX T FL 
LFDLE I ALLLPLP * AVQTTNLPL I VMSS 
LLLI I ILPLALAYE * LQKGLD 


7611 


21512 


A 


7671 


1 


313 


ARGERERERERERERERERERESGGGGP 
TQTDCKGGRNT*RGREIYRESE*DDRPP 
FL P T YRVNLQRP VGLRRL KGAGDKT FCL 
ILTLARDYVWPDYRMKRADHM 


7612 


21513 


A 


7672 


3 


91 


TRRERERDR * RERERERERE I FRE KNSQ 
S 


7613 


21514 


A 


7673 


1 


115 


DELANLF II* KAGFPVLPRLVLNS WLQV 
ILLPWPPRLN 


7614 


21515 


A 


7674 


3 


232 


TRRERERERERERERERERERERERERE 
RYRE*DRERERERERDCVWGGGVISLSR 
AVALSGAP AVGRQTRER I S R 


7615 


21516 


A 


7675 


1 


234 


ARGERERERERERERERERERERERERE 
RE IERERERLTQRERERERGRGVWHAPL 
A I E RG * KP QGGFWWKE E RVKKE P LGKLL 


7616 


21517 


A 


7676 


1 


262 


ARGERERERERERERERERERERERERE 
RERERERERERERERARAPQNL* ERWAH 



1018 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 
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TPPAPGVSLSWREHTRGGPPWGKKEGGW 
GRLGH P P P S I LRAKGG 


7617 


21518 


A 


7677 


214 


2 


QQAS VAQAGVKWPDLCSLQSMP PRFK* F 
SCLSLLSSWDYGPPPPCAGLHIWRRNVT 
LALQGLPTTSLALV 


7618 


21519 


A 


7678 


3 


341 


HERHELEELIKTLFFFFFFRKKNHLCKP 
GGKKRGPIWFLKKFFSPPQKKPPPPPPF 
LREIGAMAPQPGGGLLI*NKTPFRPRGQ 
GGPKPPTPGKPNPPPPPKGNFPL*GPNP 
QPPGSPTPPPPQKVIFPFEGHQA 


7619 


21520 


A 


7679 


1 


357 


GTR INTLLALLLI 1 1 TF * LPQLTGYI KK 
S TP YE CGFDP I S PARVPFS I KFFLVAI T 
FLLLDLE IALLLPLP * ALQTTNLPL IDM 
SSLLLIIILTLSLAYE*LQKGLD*TDFY 
SVGGEAN 


7620 


21521 


A 


7680 


365 


1 


PDASQ*HCTNGQSNRLLNLLIKALLTTA 
KIWIQPKCPPVDARIKKMQ*IPRMEHPS 
AIKKEILPSVATWMSLEDTVPSRISQAQ 
KDKYHT VS CMWNLRMLNAE P EGRP WPR 
GVQGWGAARA 


7621 


21522 


A 


7681 , 


387 


3 


FFFKKEFCPRKKAKK*NWGPGNLGPRG* 
KNFPPPPPQKRGKKGGPPQGGEILGFL* 
KKGFP PMGRGGS KLWPLGFS PLNPPKRG 
GKKGEPPPPPQRGKPGGPPPKGLFFFFF 
FFLRQSLTLSPRLLV 


7622 


21523 


A 


7682 


107 


290 


ELNKRWGLGAVAHACNPST* *ELNKRWG 
LGAVAHACNP S TLGG * GRW IT* ALE F K I 
NLDNKARSHLKI KKQHQQQKNSGLDA 


7623 


21524 


A 


7683 


1 


348 


GTRERKPTWLWYHRERGMERDCS CVPGS 
SGISESRVWVQVGMEVYPAI I YLCLEPK 
YL V I S E FS P AFR I W F LGLS WAHT CNP S 
NLGGQGGRIT*SQEFKTNLANIVRPHLF 
FFLF 


7624 


21525 


A 


7684 


364 


77 


GGPSIFPRLALLGGGGPGGPPLPRGNFS 
FFNFFVKKGVFFPPGVFPPLPPGVLPPP 
PPFWAPG* PRPPNFFLGAPGFFPFFFLG 
GGFFPVAPGGV 


7625 


21526 


A 


7685 


3 


24 


HEET 1 1 QVKREPVE * KKI FANP T FNKGL 
TSEIYKKQLNRKKPNNPI*KQSSK 


7626 


21527 


A 


7686 


1 


125 


GTRPGMPGTYSNYPDTYTA* KIISS IGS 
FISKTAFQPGPWE 


7627 


21528 


A 


7687 


459 


298 


LSLPSSWNHMCALPHPASFFL*RQGLAL 
L P RLVS KS WAQVML L P W P SKVLGLQ V 


7628 


21529 


A 


7688 


374 


248 


FPHHNVHE VI VRPLS PRYP 1 1 SHVTCTY 
RSRWP*ASEGSQKK 


7629 


21530 


A 


7689 


284 


163 


GTVAHTCNLSTLRGRGGWIT*GQEIDTS 
PAWPPKVLGLQA 


7630 ; 


21531 


A 


7690 


83 


255 


KVDYVS IKSEFF* DRVS LYHPS * GTVAR 
S*LTTTPNPGIKQSTCPTVPSSWDYRHV 
LPLPTRTIFSVPTQSIA.TILI * ITLNLY 
INLGKIHFL 


7631 


21532 


A 


7691 


3 


193 


HERLDPADFS FNFFS VDVGLAMFPRLVL 
NS *HQVILLLWPLTVLDYRHELLRPAWG 
FYYLNFT 


7632 


21533 


A 


7692 


1 


364 


NFKSFFP * LF YL YHLLFFGFGLFLFFL I 
KLGSSLF INFS KKSVL * FTNWI YFS VL Y 
FTDFCFI FI ISFFGFSLI YFI IAVFPLK 
FIGFILSFI*ITVFYGQAQWLTPVIPIL 
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WVAEVRGSL 


7633 


21534 


A 


7693 


370 


151 


GKKYSVASPGKLWPPQGFFKTAPPFFFF 
FFLK*RCGFTMLPRLVLNPWPQGILPPQ 
P PKVLGLQ VLRHH I WLR 


7634 


21535 


A 


7694 


3 


373 


WPGYTLNQAYAKIHFTI IFIGANLTFFP 
QHFLGLSGMPRRYSDYPDAYTT*NI LS S 
AGS F I S I TAVIL 1 1 FM I * E AFAS KRKVL 
IVEEPSQSAGITGVSHCARAEYLFIDRR 
DGLSLCWPGWT 


7635 


21536 


A 


7695 


406 


394 


KFF*KKFLFPPTPTFLLEACLFFFLPPI 
KGFLFKYIAPGIKTPPPKKEKFFSSLKV 
LFS P P YFFF I P P P P I FFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFRFEG 
ECWRL*W*FYFYFFKESPHCIP 


7636 


21537 


A 


7696 


2 


334 


ILINNTFLALLLLIITFWPQLKGYIKK 
STP YECGFDP I S PAR VP FS I KFFLAAI T 
FLLFDLE I ALLLPLP * ALQTTNLPL I VM 
ASLLLIIILALSLAYE*LQKGLD*AE 


7637 


21538 


A 


7697 


2 


95 


LNLTIYIILTTTAFLLLNLNSSTTTLLL 
SRT*NKLT*LTIYI ILTTTAFLLLNLNS 
STTTiiLLSRT 


7638 


21539 


A 


7698 


423 


326 


RQCLTMLPRLVSNS*SQVILLPWPLKVL 
GSQA 


7639 


21540 


A 


7699 


400 


32 


FHKEYESYFFSPNQPQFFFFFPHR*NFY 
VGVLKKQ P P KKKFF FLLFTPER FFF P FL 
*KKNIFFFP*YFFFPLVIFFL*PPPLFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFSSNSFPFY 


7640 


21541 


A 


7700 


125 


407 


FINFSSTFWKPTTCNMQKHTPIARTKD 
LCITIFFFFLEKNFLFVPQVGGQGGILG 
*LKILLRG*KQFSCFTLPGRWTNGGIPP 
PPKNFLiKKFF 


7641 


21542 


A 


7701 


50 


225 


P WNM VKMS C WLGKVAH VCNP S TLGGRGR 
RIA*AQEFETS*VTQ*DPVSVIKKRKKN 
FK 


7642 


21543 


A 


7702 


2 


146 


EI ALLLPLP * ALQTTNLPL I VMSSLLL I 
I ILALS LAYE * LQKGLD *AE 


7643 


21544 


A 


7703 


1 


93 


riillNTLIiALLLLIMTMGLPQPKGYIKK 
S TP * ECGFDP I FPARVPFS I KI FLIAIT 
FLLFDLETALLLPLP * ALQTTNLPLIVM 
AS L LL I I I LS LS LAYE * LQ KGLD * AL LL 
LIMTMGLPQPKGYI KKSTP 


7644 


21545 


A 


7704 


2 


112 


GRVGKHHFGFEAAA* YWHFVDVG* LFLY 
VSIY**GS 


7645 


21546 


A 


7705 


1 


325 


TAGQFLPKLSILLSYNPAITFLGIYPKI 
LKTYVYMKTCTWMFIAALFIWQTWKQQ 
SKLWYIQTIKYYSVLK*NELSSYENTSK 
KLRCI LLRERSQYKKPPYFL I PTM 


7646 


21547 


A 


7706 


2 


169 


SRSRAGTLAI*TINLPCTLMIFTILIVL 
T I LE I AGAL IQ AYVF T L WS L YLHD S T 


/OH-/ 






77H7 






MT.TfvnrDT CT T .^Tr'XTTT .Q.V"n'C'X^Q.\TTtr\r*r~ { '^T 
l v iJ_iXl\ V Ir -LAjJ-ii-ir ± ijo is.cj aro V1J V 

QWINLFSMQPPPTGFK* FSCLKA* ASDN 
LSPHEQYRLALSFLKLTL 


7648 


21549 


A 


7708 


343 


163 


PKEF* * RQGFFHVGQAGFKLLRS GNPP P 
SASQNGKITGVNPLAWQTNNS IPPMAP 


7649 


21550 


A 


7709 


459 


144 


F WPGAS SHAFDP TTLGGRGGR I A * AHE F 
KTSLGNIVRPPSDTCNPKVLGLLA 


7650 


21551 


A 


7710 


2 


83 


NFLPLTLALLI*YVSIPITISSIPPQT 
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7651 


21552 


A 


7711 


19 


181 


KYVHTKACT * MF I S SLFI I TKNWKQP I F 
SSIGE*VNKT*YIHTMEYLLFRNKRK 


7652 


21553 


A 


7712 


392 


49 


SDIFNSDIFSHY* CSVF*S YLAYNRYLS 
FL*LFLEFIYSYP*IDHAFHCFLHPVSS 
FWLDFFLFIFSFFLjFSYFFSYFYF*FYF 
*FIFNFIFNLFFILFYF*FSFFSKCLKF 
PC 


7653 


21554 


A 


7713 


334 


208 


F F F * E W VLT TL PRL VLNPQAP AI LL P W P 
PKVLGFTGVSHIKI 


7654 


21555 


A 


7714 


2 


408 


WNGME WNGL KWNRL ER YGMQ WNGLE WNR 
TE * TRMVWNAMEWTR1 ELNGLE WNGME C 
NGMHLNGLE WNGME RNGTD P E WNGTD P E 
WNGTDPEWNGTEWNGMEWNEVDSNGMEW 
NG I DSNEMGWTRME WNGMES TRV 


7655 


21556 


A 


7715 


362 


223 


VWWFMPIIPTLWEAEVGG*LEPRSSKPA 
WATWQDAIFKKKFKHQHP 


7656 


21557 


A 


7716 


2 


335 


LI VI INTLLALLL 1 1 ITF* LPQLNGY IE 
KSTP YECGFDP I S PARVPFS I KI FLVAI 
TFLLFDLEIALLLPLP*ALQTTNLPLIA 
MS SLLL 1 1 1 LALS IxAYE * LQKGLiD * AE 


7657 


21558 


A 


7717 


2 


142 


SITL* LPQLNGYI EKTTP YE CGFEP I CP 
ARVPLS I KFFLE AITFLLFDLE I ALLLP 
LP*ALQTDDLPIiIVMTSLLLIIILTLSL 
AYE*IjQKGSD*AYMPRPRPFVHKILLRS 
YYLLII 


7658 


21559 


A 


7718 


6 


92 


F FE AAA*" YWHF V1STW * L FL Y VS I Y* * GS 


7659 


21560 


A 


7719 


417 


164 


PLEPLISGRGLPQIAPPPKKGSPPKSPR 
WFFFPPP*KKKNYFPPPPKILPPPGFF* 
NPPPPFFFFFFFFFFFIFFYFFKTALPL 


7660 


21561 


A 


7720 


25 


401 


THNS * DPNWD * I PHYA* P * T S TVKKKKK 
KKKKKKKKKKKKKKGGGL I KKFRGGP KY 
TGGGKI IFFFFMGGKKKPLGDFLKKNFF 
LGGGNLGKHP P KKLSLQKKKKNFKGRGG 
KKTPLCRRGKKFS 


7661 


21562 


A 


7721 


388 


2 


APPFFF YFFFF I FFFYFFLGVGFS FK * N 
SKVFWISNFLKKILRVFICSLEKGINPF 
LEKCF YMFFFFVAF I IRLNFSLQAKFFL 
LNFALLiPFPEIFFFFPFFFFFPQIEDFW 
HPYIEQVYQLSLFSLS 


7662 


21563 


A 


7722 


389 


71 


FFLPHQKQVFFPPPPFRIFFFFSRVFFF 
WGGWAQKPPPPKKFFFLKTPPGFFLPPP 
*KKKFFFFSPRFFLPPPGFFFKPPPPNF 
FFFFFFFFFFFFFFFFFFFFLN 


7663 


21564 


A 


7723 


77 


220 


TPRRGWAYWLMPAYPAILEA*AGGLLEP 
KSLR S T WP TWRNS ISPKRK 


7664 


21565 


A 


7724 


16 


258 


ISSFVFKRRSFAMFPRLVLNSWPRWLP 
PRPPE * LGLQACTTTFS FNFFDG I FLR P 
LKEDVTARDQRE ILYVFALFYYGGA 


7665 


21566 


A 


7725 


353 


111 


IFGAGPRFVFKAGGQGANSRSKECPFPPG 
NKWSFPFSPSKKWEGRG*PPRPPFFFFF 


7666 


21567 


A 


7726 


2 


185 


MSMGHTRLSSAWTGKPPLSVEDDFEKLI 
WEISGGKLEAEIDLDPGKDEDDLLLELS 
*MIDT 


7667 


21568 


A 


7727 


346 


143 


SQAFLYIiESFLLLIiLIjY*FIiFFEMESHS 
VTQDR VQWRHL S S LQ PLP PR FQQPS RL T 
LPSSWEAEVGG 


7668 


21569 


A 


7728 


411 


221 


NLRPPGSS YSPASAS * IAGI PGARHHTQ 
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LIFAFLAE ILKNLAKLYL I PRKPNI YYL 
PNCYINI 


7669 


21570 


A 


7729 


2 


303 


AAASTNMLLFFLGRHLWEWLGHMVGLI 
FTFKKLPNQAGRRGSHL* FQHFGRQRWE 
DHLRSRVQDHPEQHSKTPSSLQKNKMSI 
ESNLLNIHKLFSTGRM 


7670 


21571 


A 


7730 


67 


65 


S ICEYCFVSSVFVLKRLFSVMQAFSLEG 
CVS CQS * AFLQH YL FKKL YFLY * WGLTM 
L S RL VLNT WPQ V I HLi P WP P KVLGL * AF C 
GNEA 


7671 


21572 


A 


7731 


354 


238 


QS ILDDVAMVLD * QAEVF I VKMWRLL I Y 
VHEAKKIGLVK 


7672 


21573 


A 


7732 


33 


186 


RGALAVLSRLILTPGLKESSCLDLPKGW 
DYRWE PPRPGCFF * LMVLVLS F 


7673 


21574 


A 


7733 


1 


181 


VNAGADCS S IGGVP FLQHKKCHGKDYE P 
RGITTLERS YVEETTEHLVSKSK* PLRA 
QINL 


7674 


21575 


A 


7734 


336 


80 


HKLKEPPGVFPVFPFKNLEFGPGPPKFF 
FKKAFSPILSFFFGKIFKIPRFRGENFA 
P*NFGKNS*KPRFSPPPPKKKGFFFFFF 
F 


7675 


21576 


A 


7735 


2 


163 


TPVSTGTPVPTLT*VPSPIIFPVSEKWA 
GCLHLCLNFTCTELRLLTSLLTIRS 


7676 


21577 


A 


7736 


87 


258 


KAPSVCLFSALLMLLRMSARTSVCTVRP 
LSPS*AVISPVTCTYTSRWPEATEDSQK 


7677 


21578 


A 


7737 


1 


103 


LDS KG I YMTLQ P S KLE P KLE ANVE I RE H 
MLENSSRF * RDLYD S AAFKART KARS KC 
RDKRAHVGEFF 


7678 


21579 


A 


7738 


193 


300 


GGLPPPPFFFFFFLKKRGFPWVTRGGPN 
PPPLEMGGPTPPKGWNYGGGPPP*FHPF 
GGVGPPISKGGGLGPPRVTQGKPLFFKK 
KKKKNGGGGKPP YS PFLGGVKQKKGVNP 
KGGGSKKPKFPPPPPPGGKKKKPFLKKK 
KKKK 


7679 


21580 


A 


7739 


150 


17 


GRVAQ VWWLMHAI PAL * KAEAGRSLEPR 
SWTPDWGTWHLPISTK 


7680 


21581 


A 


7740 


1 


358 


SPPRPPPPPPPPPKEKFKKLENPPPPPK 
NPQKQNPPPKTKKKSPTTTPKKKPKIGT 
PPQTPKKINCS*KKKNNPPENKKGLNYP 
NKPNHPHAPPKEPKQREKPLVPTKKKKG 
PSPKKKK 


7681 


21582 


A 


7741 


235 


147 


F*FFLGDEISLCCPGWSLTPGLEQASCP 
GLPRRWNYRCEPPHSVKTIF 


7682 


21583 


A 


7742 


131 


12 


AASTYGQ* FKICGAILRLMPI VIEFIPL 
KILQIGPVFTL 


7683 


21584 


A 


7743 


1 


184 


NQYPWCFCNHMSMGKES KTLNRSGMVAH 
TCNPS TLGGQGGR I T * A* GSKSS LGDKV 
RPHLY 


7684 


21585 


A 


7744 


347 


202 


SLFVETGYHYVD*AYLELLASGSPPASA 
SQSFGISSVShCTQPMFQFE 


7685 


21586 


A 


7745 


138 


263 


KKRKT Y IHTKTCM * MF I AALF I RVRKWK 
QS KCHSADEW INKF * Y I KI CLS I HQLND 
IWIASTF 


7686 


21587 


A 


7746 


358 


169 


AAAVAERTQNTEKTEDLVLGLWIKKVIY 
WPGWAHVCNPITLGGRGGWIT*GQEIE 
TSLPTRR 


7687 


21588 


A 


7747 


78 


346 


AI QMLRNQVKI KLPKS P VE 1 1 PFKIVPQ 
FQVQKI KSQNNFLLIiFQE SQ I FFATS TG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Q=GIutamine, R=Arginine 7 S=Serine, 
T=Threonine, V^Valine, 
W— Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SVRP LS PS * AI I S P VTCT YTSRWPE VTE 
KSQKK 


7688 


21589 


A 


7748 


222 


98 


KKKARHGLGAVAHACNP S TLGGQGGR I T 
*VQEANIVRTCLY 


7689 


21590 


A 


7749 


314 


52 


LNISTVD LQ FFLG P RKKKRISf LND S S LF VS A 
EEVRLMYCFNKLAILHYF* FSNKFI VFV 
*KFGHLLDENMGS KFDNIGMNAMANKDN 
ASK 


7690 


21591 


A 


7750 


371 


216 


YHTHTHTHTHTHTQ I P S LPLNLEGP VGY 
LLSCGSLHLKNLW* KCMRKERKK 


7691 


21592 


A 


7751 


3 


238 


FLIQTGFGHVSQAGWELLR*GNLPALAF 
QSAGITGMSYLCVAESLYLLPPFLKILC 
SSLLTLS PYDTWKVALLCSYTV 


7692 


21593 


A 


7752 


2 


247 


NSDIFDIYIIQYIFLIYLIYIIQYIYLI 
*VFLIIFFKFTLK*EFIFISLSISFRLA 
FIVYRDVILLFLFFFS*NFVGDSIS 


7693 


21594 


A 


7753 


359 


2 


TLKIRPLLYLPLFFFFFF* KKKIFFFPP 
GGGKGV* SGFYLMGRGTFSLFFKRFFLF 
YPPKEGGLRGSPPPPSFFFFFFKKRGFP 
FLARGV* KKGPQGTPLPWLPKSLGLRG 


7694 


21595 


A 


7754 


3 


174 


CSVIQAGGQWCGHSSLQP*IPGINQSSY 
LSFPSSWDYRHATMPS*FLFFIEIKMLQ 


7695 


21596 


A 


7755 


292 


219 


IWPLSHVHSNPLTHSPSQVDQGLFTTLL 
PCLS G YMYVQ F P LiAEMP F I S S LLLENAC 
LSLCEGSEAQKPLLWEGLSYLPPLNQLP 
LCL* LCVCI S GGVCRCLMC I LGVCVCMH 
VCVS VCP CTHTLAQETKGC 


7696 


21597 


A 


7756 


2 


195 


VALMADGAIDTESNDYGAFMPLGIERGL 
DRIWEMPELWLRPNEFDCMTD* PCIQHA 
PSVSCGLiA 


7697 


21598 


A 


7757 


4 


280 


DHIVDMITPSFTRRTIAVF*DLNLYIVI 
RGH I TS I LKPNKNLKLW Y I YT I EYYSAL 
KGNE ILIHAT I WINLENMQDE INQTQKD 
IYCTISLI 


7698 


21599 


A 


7758 


352 


107 


FLTRAS RGHNGET I VTS WPGAVAHVCNP 
S S LGGRRRW I T * GQE FKT S LANM V I T F F 
I SKNSNRFLSQFP I S LRPTHYKVLS 


7699 


21600 


A 


7759 


92 


4 


RPS PRRGGWTA* GQE FKT S LGNMVKP CL 
Y 


7700 


21601* 


A 


7760 


92 


4 


RPRRRRGGWTA* GQE FKT S LGNMVKP CL 
Y 


7701 


21602 


A 


7761 


328 


124 


PLFSFLFFPFFFFFFFFFSRRDGVLPCC 
PGWS * T PGLKLS SSL CL P KC WN Y KHE P L 
HTAHHNF 


7702 


21603 


A 


7762 


1 


167 


VDSTDKRPGAVAHNYNPSTVGSQGGQ I T 
* GKFNP S LANT VKL F KKKKKRKRGAL F 


7703 


21604 


A 


7763 


309 


226 


RPRRRLTLSPRL* CSGAISAHCKLRLPG 


7704 


21605 


A 


7764 


349 


7 


QVFCFLF FVCFS S I FMTFNE * QGLKVTS 
G ISSNVY* * FLL I LGKI T YQD IGRCFS 
EYGSPEQHNLGIVRNASSRLGWAHTCN 
P I TLGG* GGWIT * GEGFETSLANMGKPL 
LY 


7705 


21606 


A 


7765 


43 


339 


FFFFLFFFFFFI IFLLLSFFF*DHFFVS 
LLFI SNFVFLFP I F I FTVFQAHLF I CLL 
FFPICMFLNLFFPNFNYYFYPFPLFFYL 
LFFFFG * FFNHL FYL 


7706 


21607 


A 


7766 


352 


102 


HTSCLIIKSSLSKMNISCVSSSISFFF* 
FI LIE IGSR * WQAGLQLLGS TDPPAS A 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ifig to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F— Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V— Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














SQSARITGLSHHAQLSSTSLVVENVEW 


7707 


21608 


A 


7767 


346 


90 


TPPSSFFSFPPFPFFSFIPFRFFFFSNR 
DRVLVCYLGWS *TPELK*SSQFGLPKCW 
DFRCGPPCLTSTAFYIS ITVNFSPSNCY 
S 


7708 


21609 


A 


7768 


278 


87 


AASTGVGPSEPSAGYYLLVCRLLRSLEK 
CSIRVGVT*FSRCHLSPLSLTRKGNSLT 
PCASRVR 


7709 


21610 


A 


7769 


5 


272 


AVYHP INES CVLNI GKDS CLLY * LFEKK 
KKKKKKKKNKKKKKKKNKHYE* FNTFPG 
VARKINLFLCIQKVDAGGGTLWGGGPPI 
FFFFL 


7710 


21611 


A 


7770 


198 


33 


QRPQSRQCRAWLGMVAHAYNPGTLGG*D 
RRTT*GQEFKTSLGNIVRPYLYEKMF 


7711 


21612 


A 


7771 


97 


410 


A*WLMPVIPIIWEVKVVRLLEPGSLRPS 
WATWRKP 


7712 


21613 


A 


7772 


5 


415 


I LCVYLHFVGTHP FVHL * YMH FLPLTLA 
LLI*YVSIPITISSIPPYT 


7713 


21614 


A 


7773 


1 


342 


WRVT S GHS G * AAAYAT * Y I L S AEGS F F 
PLTEELLI*FMKREAFT*KR*VLIIHEP 
S INLD * LYGCPTP YHTFPDPVYLNSKRR 
RRD S H P P KLAS RL PHVLHD F F KKKKGGR 
RF 


7714 


21615 


A 


7774 


398 


122 


SPPPPFPSSPSFFLPPPSSSFFLFFLFF 
FFFFFFFFFFFFFFFFFFFFCRD*VSIiC 
CPA*SQTPGLK*YSYHGLPKFWNYWHEP 
PCLAYFL 


7715 


21616 


A 


7775 


1 


336 


FNFLLI 1 1 IEMESRS VSAHCKLSLPGS S 
NSSASASRVAGSRGAHHHTWLFFFFFFF 
F * KKKGAP PGGQGG FLTLKKKNP P PRAP 
QRGGINGGTPRPQNGGGKFFPWGEKKKF 


7716 


21617 


A 


7776 


1456 


1232 


FAIESHGVTQAGVQWCNLASLQPSPTEF 
K*FSCLGLPSSWDYRCVPPHPANFYIFS 
RDRVSPCWPGWSKTPDLK 


7717 


21618 


A 


7777 


400 


2 


NTEFPYGPPISLLGM*P*EMETYIPRKP 
CTQMFLEVLFTIAKK* KKPKRS * LING* 
ILVYPLMEYYSATKRFDVMIHTICNVDAI 
GNI ILSERRQM * KTHIVYS IHRAS CTAE 
VRIKVSYKRAATWIKSILIA 


7718 


21619 


A 


7778 


360 


218 


PYLANF*NLL*RWHLNMLPRPDSNSWPQ 
VI LSLWP PRVLG * QRGGRVEQRGARGNW 
MS KCPEGGDCLCWVAGA 


7719 


21620 


A 


7779 


1 


372 


FEVRSCSVAHAGVQWHDHRGFEFIiGSSD 
DPS VSAS CI SRI TGASHCTQG *L* LLTK 
VL*VSAFCLSVNTLRLTSLLKSDTFLGS 
RSKLNSLGESLQTYISPSRHFIQLLCFR 
SliFPRFIiVKHKK 


7720 


21621 


A 


7780 


372 


219 


ATSLLSYFKKLPQPPQSSATTTIISQ*S 
STSRQDLPPAKRLELTEGSDDP 


7721 


21622 


A 


7781 


358 


145 


I KNIHD S WED VK I S TLTR V* KE LIP VL T 
DNS EGLKTSM* EVTAE WE I ARELELEV 
EPKDVLVQFHDTINR 


7722 


21623 


A 


7782 


317 


146 


GRVDCKCPFYRFQMLLVRLGMVAHVCNP 
STLGGRGGWIP*GQEFKASVSRMPRPHL 
Y 


7723 


21624 


A 


7783 


13 


186 


DRVS I T Q T S WAH PPT S AS * VAWTTGMHH 
HTWLNFVFFVEMGFHHVGQVG IRLPLLS 
WK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H~Histidine, I=Isoleucine, 
K— Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=TJnknown, *=Stop codon,/=possib!e 
nucleotide deletion, \~possible 
nucleotide insertion 


7724 


21625 


A 


7784 


339 


221 


KWD LAMLARLVSNS WL QGS S YL S L * NCW 
DYGSEPLCPAL 


7725 


21626 


A 


7785 


355 


3 


FGNSGRVDLFFFFFFLDRVSLCCPGCSA 
VTNLEVHHSALQPQTPGLKPSSHLSLLS 
SWDYRRVPPHPS*GLF**GK*YSITQLR 
FLNLHLESGLRSLLPKPVASTVRIRTQS 
GLGLK 


7726 


21627 


A 


7786 


1 


201 


QQFSHVFRLLFQFSFSCQEPPSSHIQPG 
VCTRNTKVSQAW* CVPVI PANREAETGE 
SLEPGRRSTRP 


7727 


21628 


A 


7787 


270 


1 


RPRRHLVPCVLMLGPKCPGLPHTLTTLP 
YTLTTPSPTRSLGPPECK*SFHLSFPSS 
WD YRRMPQRLANFCRGR I LLC YPDWS S S 
PGLKQS 


7728 


21629 


A 


7788 


229 


2 


DGWVRINWAQPGTVAHAYN I SNLGGRGR 
RIT*GQEFKTSLTNMEKPCLYSDHEVRS 
LRPARPTWRNPISTKNTKM 


7729 


21630 


A 


7789 


118 


263 


CSKSYYWPDAWHTCNPSTLGGQDGWIA 
*AQEFKTSLGNIARPPISVF 


7730 


21631 


A 


7790 


246 


359 


TFI FSETRSHS VTQAGV* WHDHGSLQP * 
PPWAQVILSC 


7731 


21632 


A 


7791 


70 


102 


AA*SRLTATSLSRVQAIPLPQPPEQLGG 
QGGRIS 


7732 


21633 


A 


7792 


7 


312 


FLDFQLRLHSNS YCEDKGVFDS E I YE I C 
IVLICKK*RKFLNQKKKKKKKEKPGLI * 
KKKKNSQKKWFDFPWLPPNPETKNSLFL 
PKEFLWLKTKPPFPLTLT 


7733 


21634 


A 


7793 


1 


341 


XKPE * YLLFAYT ILRSVPNKLGGVLALL 
LSILILAI IP ILHI SKRRS I IFRPLSRS 
LY*LLAADLLILT* IGGQPARYPFTI IG 
QVASVLYFTTILILIPTISLSENKVLLW 
A 


7734 


21635 


A 


7794 


1 


143 


YGSTFFVATGFHGLHVIIGSTFLTICFI 
RQL I FHFTSKHHFGFEAAA* YWHFVDW 
* LFLYVS I Y* *GSTFFVATGFHGLHVI I 
GSTFLTI CFIRQLI FHFTSKHHFGFEAA 
A 


7735 


21636 


A 


7795 


2 


257 


KWAI IEEFTKNNSL IIPTI IATI TLLISTL 
YFYLRLIYSTSITLLPISNNVKIK*QFE 
HTKPTP FLPTL IALTTLLLP ISPFILII 
L 


7736 


21637 


A 


7796 


22 


326 


RDASDCSFQNLPVPLWEK*MVFLLTKK 
KKKKKKKKKKKKKKKKKGGG P F KKNS WG 
AKIIPGKKKI IFFFLEGQKKKLWGFFKK 
KPLFWGGKKRPNPPKKN 


7737 


21638 


A 


7797 


506 


281 


RGAN*NRSGCGKRHEERERERERERERE 
RERERERERERESPRPPKRQRERErEIQ 
TLSLTVSLAPPPTCVF 


7738 


21639 


A 


7798 


435 


336 


MRSNIHFFSHTHTHTHTHTHTF* ILKQH 
TFSK 


/ /39 


21640 


A 


7799 


411 


106 


RJNlPPPLFFSPPLQRGGFPPIYWGPPRFF 
PPPPL*KTPPKLKIGAPPKKKKPPPPPG 
EKMVSF*PPPPFFFFFFFLRRPFPLFSP 
GGGPWGHFRGPPPPPPGV 


7740 


21641 


A 


7800 


2 


296 


F VPSTANWCFFLWS VF * TGGSNYFFVHR 
S YSQAGVDL I FIRLANT I SNLS F I * * RG 
GLAMLPRLVLETRAQTILHSWPPKVLGL 
QVWATVPSFQFLKN 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=AIanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


VSSJS 


nucleotide 


nucle- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, HNBistidine, I=IsoIeucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lrysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "—Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7741 


21642 


A 


7801 


344 


105 


PGFFFFPPPGKRGFFPKPFFLGTPGFFP 
P PLF*NPAP * FFFWAPKKKI FS P PPPGK 
KIFLLKGPPLFFFFFFFFFFFFF 


7742 


21643 


A 


7802 


68 


216 


VHTIAKIWKQPQCPSMDD* IKNMRYLYT 
VE * Y S VLKERE I LLY WGHYAM 


7743 


21644 


A 


7803 


407 


3 


GFIH*FPLFSGYTLDQTYAKIHFTIIFI 
GVNLAFFPQHFLGLSGMPRRYSDYPDAY 
PT*NILSSVGSFISLTAVILIIFMI*EA 
FASKRKVLIVEEPS INLEWLYGCPPPYH 
TFEEAVYI KSRQKRKESNPPMY 


7744 


21645 


A 


7804 


2 


156 


THRRIIILSQGL*TLLPLIAF**LLASL 
ANKKKKKKKRGGPFKKFPWGAKI 


7745 


21646 


A 


7805 


2 


309 


THSRI I ILSQGLQTLLPLIAF* * LLASL 
ANKKKKKKKGGAPLKKIPGGAKI KRGME 
I KNFS QKGGEKKTQRGIFGKKP YhGGGQ 
NWE KL PKKFKGLKGKKNF 


7746 


21647 


A 


7806 


110 


362 


TLSDLERLLMKAVSHFLM I DLLE KC * C V 
LKNQS SKKKKKKKKKKKKKKKKKKKKKK 
KKKKKGGPLLKNSWGGPNFPGGEKXFFF 
FFRGGF 


7747 


21648 


A 


7807 


295 


108 


KQRGF VFFFFEME S RS WQARWQWCNL 
SSL * PPPLGFKQFS CLRI VYRKDSLS FN 
KPRLLP 


7748 


21649 


A 


7808 


2 


142 


GSTFFVATGFHGLHVIIGSTFLTICFIR 
QLIFHFTS KHHFGFEAAA* YWHFVDVV* 
LFLYVSIY**GSTFFVATGFHGLHVIIG 
STFLTICFIRQLIFHFTSKHHFGFEAAA 


7749 


21650 


A 


7809 


390 


2 


PASLLHCG*ISDCCASNE*GSVGLGPSE 
PGAGYNLLVCHLLRPLEKCS IRVGVTRF 
SRCCLSPLPLDRKGNSLTPCTSQVRQCL 
ALLQLTLGALHPVS CTHCPT I SGEMNP V 
SQLEMQKSPIFCVTHAG 


7750 


21651 


A 


7810 


1 


157 


FLHFGQAALELLTSGDPPASAS *SAGIT 
GVSHRAQL VCTF I T I YVFLKNS S Y 


7751 


21652 


A 


7811 


368 


44 


QNFPPPKKRI1PPQPPI1FFFI1 VPS FKGEG 
FFFFLRGFFFFSSAQRALL*FIYFIFFF 
FFFSFFFFFLYFSISPDSKGDTHHDLLL 
GVSWWTRSLPLWIARHNMHKWGW 


7752 


21653 


A 


7812 


400 


80 


PQNSFSPPGIGGFFPPFPL*NFFFSPKA 
FFFLGGFSPFFPPPKKSFFFKIPPGFFF 
SPPLKKKIFFFPPPFFIAPPRFFFKAPP 
PFFFFFFFFFFFFFFFKYFRHI 


7753 


21654 


A 


7813 


119 


230 


SEEFETSLRCIVIPSL*KKKKKKKKKKK 
KKKKKKKKKKKKK 


7754 


21655 


A 


7814 


3 


412 


HEQEHL * LLLP * * PLAI I * VI STI^AETN 
RTPFDLAEGETELDSGFNIEYAAGPFAL 
LFIGEYTDIIIRNTLTTTIFLGTTYDAL 
SPELYTTYFVTKTLLLTSLFL*MRTGYP 
RFRYDQL IHLL * KNFLPLSIiALL I 


7755 


21656 


A 


7815 


1 


183 


LRLERLSEFSTRRERERERERERERERD 
EFSKR 


7756 


21657 


A 


7816 


484 


97 


QPRTPDLK* FACLGLPKCWDYRHKPPCL 
ASDGNHS * SSPGLLVLSSASVSPSGHMS 
PSQQTSPWSEESLLLGRVPGFLYFPPV 
DAPGPGAWIiAHWQAVLKLLGSSDPPVL 
PSQSAGFAGLSNCAWPW 


7757 


21658 


A 


7817 


2 


219 


ADRLRNS * EA* EREREREREREREREP Y 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=L»ysine, L=Leucine, M=Methionine, 
N— Asparagine, P=Proline, 
Q = GIutamine, R=Arginine ? S=Serine, 
T=Threonine, V=VaIine, 
W—Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPSHIYTDRNRH*MRHEERTLFLWGSGE 

^7T?F>Ii A DTAn/CJWTDKTT'ZiT, 
V i\iJ±^t\ir /V V orl J. JriV L I-\±J 


7758 


21659 


A 


7818 


190 


292 


WSHLIPPWTKVFFMS*CTYNLVLPGSEK 


7759 


21660 


A 


7819 


498 


139 


RRAATPAPAAAEPPPPLQRP* PPSEPPD 
TQRDARGRRRGE D PGNS P FH PNPDRQ P S 

DflT (^CT'DDMHTD T r.7 T ["* ID 7\ TPTPT R/TT»C3T OTTO/"* 

AVI PRNGGPGGVSSGPCLLQDLCGQAGS 


7760 


21661 


A 


7820 


489 


330 


ELGFLHVPQAGBELLS* SNPPASASLPT 
SWDYRHDHRTGHHPGIYDSKMCIF 


7761 


21662 


A 


7821 


490 


260 


FFLXXNXFXFFFFXXFXXPXFXXXFXFF 
r Jf aaaI? r r Ar r r v s c Ar rriJrcrrrrir 
FFFFFFFFFFFFFFFFFFFLY 


7762 


21663 


A 


7822 


493 


482 


\ t/~i r\i\r~*T vdt tcptit t~>~a c a cnc auttpt/o 
Vt^yAljijKKij J. fciljrJJijlr'AcsAcjy £>-A.V X lCjVb 

HRARPIMYFRYVQ*AKGSHV*RWY 


7763 


21664 


A 


7823 


456 


121 


ASFFPIQYRGLGGATPPQEGGCG*GIPI 
KWGYKRRPGGPHGGSKRPPTQ*KKSYFI 
NVLVL F YLRDKGLAI PTLVSNS WAQVI L 
PPWPPKVLGLQTRVTVPGQNIL.KENMFF 


7764 


21665 


A 


7824 


294 


457 


LCIGFVILISYFNIMENWYCRPGTVAHT 

L-XSJPi? 1 J_iC3(j*C^Kyj. IKisKUKiJiiPGyHG 


7765 


21666 


A 


7825 


26 


235 


S VWWNS QTGNKVKFT I S W * IDKQMRCIH 
LMEYYSAIKRNEVDINATTTWINLNTIK 
LSKRSQTLKSNTL 


7766 


21667 


A 


7826 


30 


303 


S YVS WE FPGS S G P TL P S P WHQ TPRT YL 
NS VASAINLTQCPH * PEKKKKKKKKKKK 
KKKKKKKKKKKKKKGGAF KKKGP * KT P K 
KTPGGF 


7767 


21668 


A 


7827 


464 


21 


nTTnr\TTm\ m\r\TT tt TTTir.TnT tit *i* tit tdtttt 

REPPHPAPADILIIjHFWPLEL*ELISVV 
LNTRCWICDDTPRKLTCQPMPRLLLQT 

T7TiT T 7T "COTVTT O "C^"* T C V O -Jr ft/I/^ T T TMTJW 

iliPLi* Viijf oWijCjrorGibijbJlib ^iXlGJjVWJr r C 
KGTDSRSFSLFVAVFCLFL*RWGLAMLH 
RLVLNCWPQVILPPPPPKLLGLQVEATT 

rxHij?xHj 1 ill 


7768 


21669 


A 


7828 


372 


205 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFIKKFFI 


7769 


21670 


A 


7829 


463 


313 


DIYPERRNSKIMSFAATWMKLEAIILK* 

i_ii\x\ KxvFIn J. v I ri J_i 


7770 


21671 


A 


7830 


449 


266 


WQMRFLHVGQAGL * LLPSGDPRASQVQ 
SAWI TGVSHHAQRS I FFKKRNNDCDRPD 

JN 1 Xi V* J. 


7771 


21672 


A 


7831 


480 


148 


IELRLSHGAAEFHRPREWYGIFKVLKER 
DFYPRI I YPAKI ILRHEGEMKTFPDKQT 
LRHFINTRPVLQEMLiKKVHQSEIKGC* * 


7772 


21673 


A 


7832 


23 


131 


QRGNS KG YHLKM I Q Q EE I RKLEE E KNQL 
EGEIIHFYKMKAASEALQTQLSTDTKKD 
KHGKKQ*FL» 


7773 


21674 


A 


7833 


16 


339 


ATI 1 '1 >rpT HOT M7\ PrTinf^T 7MV/TT T1DT T7>T lirnri T 

JM 1 D 1 LiLitj ijiyJAl? ^K.JJVjr LiAMli PK.li fi LiN i Gxj 
KRSS CPDLPNCWDYRHGPPYLAS FVI.TiK* 
* TL S I LLLi P Y YH KMHTCVFMVL CTNF CG 
G VC P GME LLCL KVYG YLS LKHT FH 


7774 


21675 


A 


7834 


355 


70 


KKTPRGFSGVFLGPFYKKGPLFFFFFFF 
FFFFFFFFFFFFFFFLFLQFFILCQQFL 
S *STEKTVHSKYILSFPFHCLlFCPFIIi 
LCSTFCNGAHY 


7775 


21676 


A 


7835 


494 


282 


PENGM I QGGGACSE PR* * HCTPAWAKE R 
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ed end 
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otide 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A-Afanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=L*ysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Prolme, 
Q=Glutamine, R=Argin ine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan ? Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














DS VSNAKTI QNKKNLGN *HSIS TAKAKN 
HIS* YKSELRKKM 


7776 


21677 


A 


7836 


305 


185 


LVIETCLTVRPLSPS*AIISPVTCTYTF 
RWPEVTEESQKK 


7777 


21678 


A 


7837 


179 


191 


LGCRKYLFYLNGGIiKCLYIKCVCVCVCI 
Y IG V Y I GMC V Y I C I YI YLMAS * NQVT S F 
DSFYVFLLNYR* I PVKVITVLFNLRMTG 
LjP 


7778 


21679 


A 


7838 


466 


343 


LGRLR QENHL P P EGRG CS E PR S RP C TPA 
WETEEDAVSKINK* INK 


7779 


21680 


A 


7839 


451 


89 


LRDTR* SGVI GLPKQAWAMVGAPPPASL 
PPCSLISDCCASNQRDSVGVGPSQPGGG 
YNLLVRRFLSLSEKRSIRVGVNRFSRCH 
LSPLSLTRKGNSLTPCASRVRQCLALLR 
LAHGARTH 


7780 


21681 


A 


7840 


442 


164 


AHTNQFSQCIKKSVTPDLMEEMYKKAHA 
AI*ENPVYEKRPKKEVKKNRWNCPKMSL 
AQKKHQVA* KKAS S LRAQ E Q AAE S 


7781 


21682 


A 


7841 


494 


359 


ICMLPRLVLNS*PQVILLPWPPKVLGLQ 
A 


7782 


21683 


A 


7842 


577 


224 


IFFFFNKIFLFRPGLNLMGDIWVPSTPP 
LQG*TNFPPQPSQEVGLQKAPPPPGLIIi 
VFLSRQGFPQLGRVGFHFPPPGDPPPRP 
FKKIWSHPGGGKSPPKKKKKKE IAEVLG 
AFRLR 


7783 


21684 


A 


7843 


404 


56 


HTLSTNVCSSYLNLDFFSLRGRGLIMLP 
RLALNSWAQEILLPQPPE*LQLLRRLRQ 
ENLLS PGVQGQPGQHNETPTS 


7784 


21685 


A 


7844 


1 


323 


INTLLALLLI I ITF*LPQGIGYINNSTP 
YEGGFDP I STARVPFCIKFFLEAITFLL 
FDLE I ALLLPLP * ALQTTNL PL I AMAS L 
LIiIII LALSLAYE * LQKGLD * AE 


7785 


21686 


A 


7845 


419 


159 


FLFFFFXFXXYVLSCGIY* YWCNLFSS * 
QYFFFYVMAFRIFLFTFLFLHFYYNMF* 
CTYFYLS CL* FEYICSHIIHYF I YFLHL 
ENF 


7786 


21687 


A 


7846 


475 


124 


FFFFFQPPQKEGGPPPPPKNFFFFFTRI 
FFFGVFFFFFPPQKNFFFF*KTKGVFFF 
PFF* *KIFFFFPGVFFFFPVFFFLSPPP 
SSSFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLKRGQAS 


7787 


21688 


A 


7847 


6 


189 


L IDT VS P S VALAGVQWCNHS S PQP * P PS 
LKRSSRLALPRFWDYRCAPLCKAHSFNS 
NHEK 


7788 


21689 


A 


7848 


20 


249 


KINFIWELTCSNTVHTFYVYGFDKCIL 
PTTQLFFFL*ETGFHSVARLECSGVIMA 
HCSLDILGSSWRPANFLNFL 


7789 


21690 


A 


7849 


101 


398 


LFQKKKKKKKKKKKKKGGPLKKKPGGGK 
NKGGEKKKIFFLKGGEKKNPRGNFKKKT 
F FGGGKKGEKP P KKKKS PE E KKKF * RGK 
GGKKS L I C WVE KNLG 


7790 


21691 


A 


7850 


87 


226 


GSLSPI MLiVLP FL I HLL* KNFLPLTLAL 
LI*YVSIPITISSIPPQT 


7791 


21692 


A 


7851 


2 


440 


GAI I RRLLH * F P L F S G YTLHQT YAKRH f 
T I I F IGVNLTFLPQHFLGLSGMPRRYSD 
YPDAYTT *NILS S VGS F I SLTAAVLTTL 
MS*EAFASKRKVLIiVEEPS IDLERLYGC 
PPPYHTFEEPVYI I SRPRRKESNPAKLV 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W— tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion^ \=possible 
nucleotide insertion 














SSQPHG 


7792 


21693 


A 


7852 


1 


80 


SSLLLI I ILALSLAYS*LQKGLD*AE 


7793 


21694 


A 


7853 


5 


135 


ATFLYFS*K*SLSMLPRLVLNSWTQAIL 
PPWTLKVGLCFFDPL 


7794 


21695 


A 


7854 


38 


419 


FVMMPIiHSSLGEGVRPYLKKKKKKPPWE 
GGSPPKNFKNQIFA*NQKPPFFFFIPPN 
KKWGSPPGGGASPL*SHPSGGPGGPNNK 
VKISKPPCPPGGTPVFTKTPTFT*PWGP 
I PEKEGRKTVCPRTH 


7795 


21696 


A 


7855 


398 


61 


FFSPRPRGGGFPPPPPKNFFFSPPTTFF 
GGGVWNFFPPPKMVFFLKISPWFFFPL 
IRKKIFNLSPGSFWSPQGFFF*GPPPKF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFF 


7796 


21697 


A 


7856 


1 


121 


LDASGRLPPSPKPIKIKNYNRP*EPK*T 
KICSLHSLPPQS 


7797 


21698 


A 


7857 


1 


332 


TPPIARPAENSARGAEGRRR* I IALQVC 
NKKW P P T KLHC I KKKKKKKKKKKKKKKK 
KKKKKKKKKKKLWGGGGGKNFFKKGGKK 
KLGGGFKKRGRKKKK 


7798 


21699 


A 


7858 


3 


191 


SPRHLTRAAGIRHEGQTGLKLLTSGNLP 
AS AS * SAG I TGMS HR I G P RMLDL LTL VS 
WKQHGL 


7799 


21700 


A 


7859 


503 


62 


FFFFLNCCLSGLRHLIKLFEPQFLHL*N 
GGMVLS I ILS IKSPLRSPSLS YS WL* *P 
REKIKWWGGQVRWFMP INPRTLGGGGGQ 
IA*SQEFKSSLDNMVRPYLiSEPP 


7800 


21701 


A 


7860 


355 


86 


Y I KG * QM VGNRGAS QAGMTG YGMPRL IL 


7801 


21702 


A 


7861 


336 


225 


PQLAGPVKTVRKVYKKEKARVI PEEKKN 
FKAFAS FRKGRANGRVFGIRAKKAKEAA 
KQD V* KKP P I PLAGGS P TRE P P S QAHL F 
PQNPPRGNPP 


7802 


21703 


A 


7862 


465 


170 


GGPPPKTHPFGGERGGKYHRGKKKPPGH 
QRVNPPSPKKKKKNWGRGKNPFTPLIGG 
GQAKKPPQPGRGECP*KKVPPNNSVLGK 
KKNFFPKKKKKKKK 


7803 


21704 


A 


7863 


495 


275 


RS FFFFFFFFFFFFFFFFLVARK * YYFL 
AT KKKKKKKKKKKKKKKK 


7804 


21705 


A 


7864 


493 


70 


PLTAS I SLPVNWRY I APHRWKI K* KS V 
Y* KYLA* CLAQSKCPKSVCSHLSS P * LH 
FVPPPNPAEAMWTARQC* CIRGLGGKGT 
L C VAHT YNP S CS GG * GRR I V * AQE F KAS 
LGNSEPLILKKERKKESQAWWLIPIIPA 
F 


7805 


21706 


A 


7865 


67 


385 


VARTTGMQHHAWL IFFIFII FVETGSHY 
VAQTSLEFLASSDPFTSVPQSTWITGVS 
YSWS*PGIAHTCNPSALGDRGERIT*GQ 
EFKRSLGNIVRPCLYKNNKNKKN 


7806 


21707 


A 


7866 


454 


0 


LSFFFFPLGDRVTLCCPG*SAVARSQLT 
VALNSWAQAILLPQLHKHVPPCIAHF*N 
F F * RQR YTMLHRLVLNS WAQ V I LQ P W P T 
CPTAAQA 


7807 


21708 


A 


7867 


1619 


698 


P AT S S S S S S S S S S S AAAA VAAAAT AAS L 
PPCRPPLRPPQPLAPHEVATLAPWRG*K 
PSFTSSGKRWLTSEPQTSGSCASCL*SM 
RASSPSSG* SKKKPPLPAEAAASVAACA 
VCWRVRAPPYVAATTAYTMAVMGWMAFP 
WEVIWTRWRMMSQAIRPLQTWTNSVTAP 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G-GIycine, H-Histidine, I=Isoleucine, 
K=L»ysine, LHLeucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y^Tyrosine, 
X—Unknown, *=Stop codon, /—possible 
nucleotide deletion, V=possib!e 
nucleotide insertion 














S 


7808 


21709 


A 


7868 


52 


187 


LDFN*RRCLTKLPRLVLNSWAQVTLPPW 
PPKLLGLQV 


7809 


21710 


A 


7869 


33 


373 


AGVQWRDLGLLQPPPPGFKRFSCLSLPS 
GWD YRHAP PHLANF VVLVEMGFHHVGQA 
GLKLP T * GDP PTLAYQSAG I TGMSHRAR 
PGISVFHLKLHS FDPFKIIFRPGAETMP 
S 


7810 


21711 


A 


7870 


100 


119 


VFVE I VFPEGLLFS PCSFHLCGFRIHLK 
AEF*FFVETGSMLPRLVSNSWAQVILPP 
WPPWL*LQA 


7811 


21712 


A 


7871 


471 


446 


LLMLPRLVLNSWAQVILPPQPPKVLGL* 
AY I 


7812 


21713 


A 


7872 


449 


33 


FSFFFFFFLGKKDLFFSPRGGGGGANSA 
FFPPPNFGKKGSAPPPPPLKSV*NPAAP 
TGVFFFFCGGGA 


7813 


21714 


A 


7873 


334 


41 


PCPSLFFFFFLRQSLTLWPRLECSGVIS 
AHCNLCLPSSWDYK*HCPYLATGAPGLL 
SQSELS YVTL * LQESRPLC I P VDFMTFL 
PSLCFQSPNSTTL 


7814 


21715 


A 


7874 


342 


716 


STGIFLFVCF*DGVSTLAQAGVQWPDLG 
SLQPSP PRLK* FS CLS LAS S WD CRHVP P 
LLANYLYF * *REGFTVLAMMVS FS * PCD 
PPTSASQSAGITGVSHCAKPGMFFLNWP 
LKSCPSESGAAV 


7815 


21716 


A 


7875 


454 


215 


PPPARCH* PP* * TLVHAVEFKSHHYSLI 
SSTQGHKHCGRPQGPLPRKTRDLCSLVY 
LLTFPSLLSYDPAKSLSARNTQE 


7816 


21717 


A 


7876 


492 


176 


P CGGRGAGGNWGWP P WHLRASAS SRWAW 
ALQPHTRSSQPAGPTGPRQ*GAQHLGQQ 
LLCSISHQALAAPPQGRAQDLQPAMPEP 
SPRPPLPWAAQ 


7817 


21718 


A 


7877 


281 


439 


TWS I DL * HMAYFLFFS FLFFAFLS FSLL 
SFSLLFFAFLFL*VPSFSLFQRQSL 


7818 


21719 


A 


7878 


507 


338 


LLRRLK* EGHLS PGGRGCS KPRLHHCI P 
HWATE*EPFSKTINK*N*S*MKMKMKIN 


7819 


21720 


A 


7879 


55 


298 


PPCLANFLKFFL*RQSLPMLSRLFSNSW 
PQAVPLPRPPKRNQLYLYFGCCSQLCNP 
QHLCSKQHTFKILFDIEKHFRLREH 


7820 


21721 


A 


7880 


145 


292 


YFGGKQIYFIFCRDRISLC*PVLLTSLV 
SNSWPEAILPSQPPKVLGLQA 


7821 


21722 


A 


7881 


3 


203 


LSLLPRLECSGTFTAH*SFKLLGSRDSP 
TSAS * IAKTTGMRHHAQLRFFFFVPGGV 
C VCVCG C V YV 


7822 


21723 


A 


7882 


324 


211 


AWWLMP VI P TL * EAKMGGS LEARS LRP V 
WAHNETPFLF 


7823 


21724 


A 


7883 


2 


124 


QHFGRLRQ VDHLRLG VH YQRGQHGE TP S 
LLK* INVYIYMT 


7824 


21725 


A 


7884 


367 


141 


KCWDYRREPPRLAQGLEFLFNFCFEIVT 
EPPCPALAYF*NFL*RWSLATLPRLVLN 
SWPQAILPRWPPKAIjGLQE 


7825 


21726 


A 


7885 ' 


471 


215 


RGMCVSHIFLCVGARYNISSARESLLKE 
FTECTERKHTHTRAHTHARARTHTHTHT 
HTHTHTRV*NSRSQPYCRVHACSPAYLG 
G 


7826 


21727 


A 


7886 


462 


204 


RHS S * LGLP KCWD YRHE P PHLAC ISSF* 
RKSDISLSDELGTKCHKWTOQWKSRAG 
SVGRDPCSGHFILAGFGQLGA 
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nucleotide 

location 

correspond 
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nucle- 
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ding to 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
0=GIycine, H=Histidine, I=IsoIeucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


7827 


21728 


A 


7887 


409 


152 


PVFLiALtERPNQQDFLRPGVLDQP*QNNK 
I S FFFFFFKKKKKYI YI CM WHAYKTP S Y 
SGG*GGRIT*AQKFKTSLNNTVRPHLYK 
K 


7828 


21729 


A 


7888 


1 


152 


GVHVCCDCKCICVCLCWGVHECLCVCV 
SK*VHMCHREGTIFGNISWEGD 


7829 


21730 


A 


7889 


101 


399 


EI INI VFFFYFFLPLLFFFFWGGRFKKQ 
LNPE FLTNS WKKHHKT I GGRNPMGG * KK 
AI TN P Q KKGRKR S F S F * NKAQ P TQGKNG 
QNSQGS KAVGALTAH 


7830 


21731 


A 


7890 


478 


142 


KFSKKKKRGVLNFYKKKAFFSKPPKPVF 
KKKKLLKNPFFKAKKQFFFKTPTRPLFR 
EKFFPGPPKKKKKRPPGLFKNN*KGPPW 
PREKKRVKKKKKKVKSHYYNNHMPISVH 


7831 


21732 


A 


7891 


497 


320 


PPWKFEFEFWFSP * LHKYYPPYVCVCVC 
VCVCVCVFDFRFFKPAPSPQSHFGINLS 
FLL 


7832 


21733 


A 


7892 


217 


23 


I P P F KKF F * EMGS HAVT * AR WQWHNHA 
SLQPETCGLKGFSHLTLPNSWDCRSVPS 
SSARRQSH 


7833 


21734 


A 


7893 


652 


470 


SSSSSSSSSSSSSSSSSNSSSSSQFERK 
TRSAPQVEGKGPNFCSLEAPPPGLSLFS 
CLNF * RS WDPGP P P * PHAT P KTRAGGDL 
LCATHHRARHCTPGLRFPLFLAL 


7834 


21735 


A 


7894 


25 


178 


G I L F F WRDGGL V I LHRL VS NF P VS S D S P 
ISASQSWDYRCDPPCPACPLIFW*LLKH 
SF*FFWRDGGLVILHRLVSNFPVSSDSP 
ISASQSWDYRCDPPCPACPLIFW 


7835 


21736 


A 


7895 


104 


451 


ILSFMTTWMEAEDIMbSEISQTQTDRYC 
MILLVESKTSTVWLRPGMMAHAYNPSTLGG 
*GERIS*GQEFETSLWIARHYFYKRNF 
KI KS KFKKI VRKSQKKNVRRWLLEARNK 
GRYG 


7836 


21737 


A 


7896 


510 


335 


SCIVSLHRSWDLWHVPPFLANF* IFRKG 
GVLPCCPGWQIYTFQLQIPFLSITPVAF 
SG 


7837 


21738 


A 


7897 


37 


260 


AGNS QNQGGGAC SEP RWRH C T PA* AT E R 
ES VS KNKQTKFLtCS VP YFES THFHCTNE 
ATGAWGVE KVALGHAASW 


7838 


21739 


A 


7898 


35 


466 


THTCGGIRKKLTNKRENQK* SQQKPSAP 
KEPEVKNKQKKTKNKPKKKKKTPQIQPT 
KKPFRJSrPNLRVFTFFVTTQNKTPTRGRK 
TFQKVKLFPPNPCLGNKPQTLLPRGGLQ 
KKKKKIPFFYSHPPPLAQKKKATPWQNG 
LGFT 


7839 


21740 


A 


7899 


494 


149 


PARLLSLHSYEVKSVLPKEEGLLNFFVQ 
S VTAHPTSCIGLEEI *LLD 


7840 


21741 


A 


7900 


494 


114 


GAEGG PliS KKKKKNIiLPDYHTGPMS LS L 
YLWPFLSNGHTLTLMCAHTHTHTHTHTN 
H * L S RVGL P VL KP G Q YQ ANGNR W S P * YN 
IKE* PRKTRVMPVTPQLRNDGPMLTFWC 
S S RNS FAMHVLGKL 


7841 


21742 


A 


7901 


19 


454 


VWCNSSVS IHTHTHTHTHTHTHTHTRSW 
VGWHMP I VLSTQEAEAGRLLGSGSLRIQ 
RAMIVPLHSSLGNAARPCV*HTGSCCVS 
QAGVQWHNHSSLYPQTPGPKQSSSLSLL 
SR*DYRHVPPHPASCVCVRVCVCVCVCV 
CVYGDRGITPH 
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Amino acid sequence (A=Alanine 
C— Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosine, 
X-Unknown, *-Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


7842 


21743 


A 


7902 


3 


393 


DAWDPLNPL VNLFVS PKRJNTS S LDTRKKP 
CRESKKFNTHSRPKSSHQLRKRSSSTPT 
T * KI PNI * LNS S TP SRAKKKRKKKP KKK 
KK* KNSQKKKKPMRKTKKKKGGGLLKKI 
LGGAKFFGGRKEKIFFF 


7843 


21744 


A 


7903 


424 


139 


LFFFFFFFF *G* GFVI FCYVGQVGL * LL 
TSGDLHI SASQSAE IRGVSHHAPPLSL I 
WFISQCVGILNHTVHLQYIQCSFVHHF 
LINQEKIDIIK 


7844 


21745 


A 


7904 


1 


181 


SQ*LY*LLAANLFILT*IGGQPGSYPFT 
IIGQGASGLYLTTILILIPTICLIESKI 
IjKWA 


7845 


21746 


A 


7905 


3 


281 


HA YAHAS AT F FVS T CFHGLHD I MGSAFL 
TMCF I RQL I FHFTS KHHFGFQAAA* YWH 
FVD W* LFL YVS I Y* * GAVLKE PWGGQS 


7846 


21747 


A 


7906 


1 


84 


PTRPLTMLPRLVSTPGLKQSSQLSLPKC 
*DYRCEPACTAYSF*LTMLPRLVSTPGIi 
KQSSQhShPKC 


7847 


21748 


" A 


7907 


3 


392 


DPDNYTLANPL I TLPHIKPE* YFLFAYT 
I LRS VPNKLGGRLALLLS I L I LAI I P I L 
HI S KQQS I I FRPLS QSLY * LLAADLLFL 
T*IGGGPRSYPFHjIGQVAFVL,YFTTIL 
ILIPTISLIGDRILKWA 


7848 


21749 


A 


7908 


2 


231 


GCVEKGTLSHC*WECKLVQPLWKTVWGF 
LKELKIDIiPFDPAIPLLGIYPEENKFTL 
GPLHSVGRSSRKLPGYSPRV 


7849 


21750 


A 


7909 


529 


297 


KIFIGAPVFCPPPHFFFLIPPF*GVEKI 
FFSFFVLLPFGFFFLFLGLFFFFFKLFF 
FFFFFFFKIPFFFS KKVGVGK 


7850 


21751 


A 


7910 


410 


61 


FNNQAMQWFPPFPLKIFFFP *RV* FLE 
GGWSNLS PPPK*GPS PKFPTRVL *GPP * 
GKN*NFGFPG*NWAPHRFF*RAAPFFFF 
C*DRVLLPRLVSNS *AQVIHPPWPPKVL 
GLQA 


7851 


21752 


A 


7911 


512 


314 


ANFC * LF I ETGFPH I GQAGLELLTSNNP 
PAIAS Q I AGTTGVNLRPGQD F S S YHSEN 
PLVYGSLYIA 


7852 


21753 


A 


7912 


1 


264 


SSTTTLLLSRT*NKLT*LTPLIPSTLLS 
LRGL P PLTGFL P KWAKKKKKKKKKKKKK 
KRGGGGWKKKPKKGGGGETSSKKDTFFK 
KGGP 


7853 


21754 


A 


7913 


416 


129 


FFFFFFFAPPPKIKTPPFFFKREVYIY* 
ALS VFQS VGKAL Y IHFRI I S SQ YP C * LG 
FFFFFFFFFFFFFLKGSLDLLSAYCVPG 
SVQGALHTLSH 


7854 


21755 


A 


7914 


397 


34 


IFSNLFFPPQKKMFFPPPPLKYFFFSFT 
ALFFFLVFFPFFPPPKKIFFFNNPPKFF 
FYPS *KKKFFFFIPVYFLAPPEFFF*AP 
PPFFFFFFFFFFFFFFFFFFFFFFFFFF 
FSVSKQPLFCLKI 


7855 


21756 


A 


7915 


396 


86 


I FLS PRK*GFFS P PP P * KFFFS PKPLI F 
FGGFFPKFPPPKKNFFFQNPPGVFFYPP 
FKKKKFFFPPPLNFGPPRVFF*RPPPFF 
FFFFFFFFFFFFFFYLWVC 


7856 


21757 


A 


7916 


351 


112 


SFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFPI*KYVFIYISLSLIPFCKFLK 
GRECDIHFCDIQHSYSKYLGHFKCSTND 
RQMNK 
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Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny!aIanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7857 


21758 


A 


7917 


445 - 


2 


FFFFFFFFFFFFFFLSYF*LIHYAEGIG 
AS PC Y I MLG YNFS S FP CGT I S I APGF1STF 
YRLYFISHASADAWVDPRFLDRYRRGIN 


7858 


21759 


A 


7918 


2 


407 


P P P P Q VYFNT WRT I F FGEGLLH I F P P Q K 
RFC F YKT YTN Y I KTL F I KKKN I FS LAH I 
KMVPPRS I Y*TPPPLFFFFFFFFLFFFF 
FFFFFFFFFFFFFFFFFL 


7859 


21760 


A 


7919 


107 


11 


KMW Y I YTLE C YS ALKKKKE ILL FD TTL I 
NLKDIMLSKISQHRKHTHT*FKT 


7860 


21761 


A 


7920 


1 


70 


FFFF*AAESCSRIKCSGTTIAQCNPELP 
GLSDPPTSAT*VAGTKGVCHHTQLRFIF 
I V * L S GR I L LQDQ VQ WHNHS S V 


7861 


21762 


A 


7921 


123 


260 


G I QARKE PGAVAHAYKLSTLGG * GGW I T 
*GQELETSLVNMVKPCL 


7862 


21763 


A 


7922 


184 


399 


LHLLMRVLKKKKKMGGPLYRS PRGAKVN 
PALQRLDSLLIGSRILSNLGIFGKAPHS 
WGG*AFFAATGLHGLHVIIGSTFLTIGF 
I RQL I FH F T S KHHFGF KADD * Y WH F VD V 
A* LFLYDS I Y* * GSRI LSNLGI FGKAPH 
SWGGTKWDNPPPDMKSS 


7863 


21764 


A 


7923 


2 


407 


GRVGMI * EAFAS KRKVL I VE EPS INLE * 
LYGCPPHPLHTSNKKPPHTKKKQKKKKK 
KKKKKKKKKKKKKKKGGGLLKKLWGGP K 
NTGGKKKNFFFFWGGKKKNLGDFLKKNL 
FWGGGKIGPTPPQKKNPLGEKKF 


7864 


21765 


A 


7924 


1 


398 


PTRPPTRPTSSRSRAPPFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFVSAALFFGKKKNFFFFF* FFRG 
* GFYFFP PLYFF I CGGGGVFFFFFFAGW 
NT FFI PPKKNGI FKTLRGKGG 


7865 


21766 


A 


7925 


2 


335 


NLTFFPQHFLGLSGMPRRYSDYPDAYTT 
*NI LS S VGS F I SLTAG I LI I FM I * EAFA 
SKRKVLIVEEPSINLE*LYGCPPPYHTF 
EE PS I KKKKKKKKKKKKKKKKKKKKKG 


7866 


21767 


A 


7926 


1 


328 


RTRGRTRGESNPSLRIQNSPCHLSHPIL 
KKKKKKKKKKKKKKKKKKKGGPLKKNPG 
GAKI * PGPKKKNLFFKGGLKKTPLGNFE 
KKPFFGGGKKRKKP PQKKKTLKE KKKF 


7867 


21768 


A 


7927 


27 


226 


LSR* KM * NKSHIYS FE KKKKKKKKKKKK 
KKKKKKKKKKKKKKQ KKENR P KNGD S EN 
GHPKSKIFWL 


7868 


21769 


A 


7928 


398 


2 


GGGGGPGGRGRP* *NPHKKEGRDPPHPK 
KKFF PRRRKNRGGGGRE KS P PQKKKAPQ 
KKPGRGFKRAPQKKKKTPPPPRKKGPPP 
KNFKKGAPPFFFFFFFFFFFFFFFFFFF 
FFLGILARTTTTTNDMKNHR 


7869 


21770 


A 


7929 


3 


156 


HASAHASAPVFV*SVLMTAVLLLLSLPV 
LAAGVT I LLADR I LNTTFFDPA 


7870 


21771 


A 


7930 


405 


249 


ASAHAS APLFV* S VL I TAVLLVLSLPVL 
AAGITILLTDRNLNTTFFDPA 


7871 


21772 


A 


7931 


921 . 


699 


DHPGQHSEILSLQKMKTMSQCGGAHL*S 
QVLRRLRREDCLSPGSQSCSEL*SHHCT 
LACETE * DLVFQKKKKF 


7872 


21773 


A 


7932 


423 


259 


HDLGSLQPLPPGFR* SS YLCLPSSWDYR 
CEPLRPAQRRGILMAATSRILTRSKGW 


7873 


21774 


A 


7933 


395 


35 


PRVKRFS CFWLPSKWGPRPRAPWPGYFC 
FFC*NGGSPFLPGGSWTLAPNYSPPPAP 



1033 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C— Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=GIycine, BNHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S==Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y—Tyrosine, 
X^Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














QNGLVQALRP PGP PTFPPL I FPTKAPWS 
W P F FGGKKKKKL S P KGH S E RHLHHKY I R 
LLSQHGGM 


7874 


21775 


A 


7934 


1 


118 


NF*IFCRDWGIALSPRQLSNSWPQVILP 
PQPPKVLGLQA 


7875 


21776 


A 


7935 


322 


383 


KKKTGLIKISTLCFVKNTFKKTKWQAPE 
RKKKVPIFLSYKRPVFKIYKKLFPFCRK 
KIKKKIPPPGFTSFSCLSLPSSWDYRHP 
PPCPVNFFCIYNKNGVSPG*PG 


7876 


21777 


A 


7936 


3 


131 


GFHRVGQVGL * LLTLGD P PAS AS QSAG I 
AGLSHRSQPLTWSIi 


7877 


21778 


A 


7937 


334 


146 


HPQGLNI *NKMVS S SGAKHS SRLSLP KW 
WDCRQEPPCPVIMFLKRGINHAIiYSPSR 
KLLHFL 


7878 


21779 


A 


7938 


2 


371 


SEPGAGYNLLVGRF * SPSEKCS IGVGVT 
RFSRCCLSPLSLTGKGNSLTPCASRVRQ 
CLALLWLVQGVLHPLSCTHCLALPSEMN 
P VPQMEMQKS PIFC I ADAGS CRPELFLF 
GHLGSSPQSKL 


7879 


21780 


A 


7939 


2 


95 


RLNLGGGGCCEP * S CHCTPAWVTKRNS I 
SKK 


7880 


21781 


A 


7940 


1 


191 


PTRPSQTPGLK* FSHLSLPKSWDYRCEP 
PCLAVILS IPQNSSMPSNIFAAS YSS * F 
GLYSSRW 


7881 


21782 


A 


7941 


127 


383 


SKDCVRIVLLRAQAKAGSYRTVNWCRPG 
AVAHTCNHSTLGSRGGRIA* GHE FKTSL 
GNIVILPLYKKNKKKKKRGARLKEPNLT 
P 


7882 


21783 


A 


7942 


235 


3 


KNILPGYFCFFLREKPPPPGLESLFSKK 
KKKKKIARHGVAPIiLGRMR* EDRLRPGV 
QGCNE LWW CHCT PAWATEQN P 


7883 


21784 


A 


7943 


3 


225 


EHGSLYPPTPGLKQSSYLSLLSR*DYR* 
AAPCPANFFIFNFL*REDLTMLLRLVSN 
S WTQAVLLP PKVhGLIi V 


7884 


21785 


A 


7944 


2 


232 


TLLPDNLSQTIYPNKKKKKKKKKKKKKK 
KRGGPFKKKKF*TRGGGKKNFFFRAPKI 
FFGGRVLKKGGGKKPGEPTN 


7885 


21786 


A 


7945 


394 


41 


WCRWLETWGAGGSGAVPPPPFFPPHSSS 
WDLALQGSGASSPFSPGAVAHACNPSTL 
GGQGGWIT*VQEFETSLANMMKPHLFSK 
KGPVTAVAHPAKAHRAMWCTPTHMHTQH 
VLHVL 


7886 


21787 


A 


7946 


405 


100 


FFFSPRFSSSLFLFLSSFFFFFFFFFFF 
FFFFFSTGLFVYWSPTH*SLPVPRIFFF 
SYLWQTEVETMLGTE 


7887 


21788 


A 


7947 


391 


212 


KEMQIMPAMR* HLTPVRMVITKKS KAKC 
WQGCEEKGTLAHSWWKCKGTAETGKLAG 
F 


7888 


21789 


A 


7948 


106 


301 


LQL P I T QL VWHS I Y FMKKI RYRLG WAH 
TYNPSTLGGQGGRIA*GQVFKTSLSNTA 
RQKKKKKKG 


7889 


21790 


A 


7949 


342 


208 


GLGAGTHT YNP STLDN* GGWI TSGQE FD 
TSLANMVKHHWQTGQF 


7890 


21791 


A 


7950 


391 


21 


RQGFPMLPRL IS3STS * PKGSAHLSLPKCW 
EYRSEPPSLATWYVTLSRHFCSTLLLGQ 
GRS KFHMKWES RliEGIiLAKVS S LRS S YC 
SSLRILGLEILLSPLLFFLETGSGSVSQ 
AG VQ PL C P GL K 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyiaIanine, 
G~Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *— Stop codon, /=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 


7891 


21792 


A 


7951 


146 


355 


EN I RQTQNE D W I KSSNETVFCKNI IIS 
LGWAHACNP S PL * GQGRRI T * AME F * T 
SLDNIARPCLYQK 


7892 


21793 


A 


7952 


289 


327 


KGIPPLPPRLKDKGPFFFFFFLKRESIiF 
IjP*FWKNGCRGPPIiFFFFFLRDSVSIiLP 
RLVSNSWP*VILLPQPPKVLRL»QA 


7893 


21794 


A 


7953 


861 


586 


DR VRS VAQARLQ WHDLGS LQP PPPGF * Q 
FSCLSLLSSWDYRRMHTWPFVF*WRWGF 
TC VAQAGLDLLS SDNP PALASQS ARI IG 
ESCCVLLK 


7894 


21795 


A 


7954 


402 


52 


PRKVS IS* PHD L PAS AS QNAG I TGVSHR 
TRPSFNFSYIETAITETRKSLQPSPGMN 
* SL P S GMVGFKPHS F IHS ANF CEAIjAVQ 
QTLGEVPGHAPLSPPRQEPRVCPKKVCL 
KTALK 


7895 


21796 


A 


7955 


259 


3 


YNTKVLS FP I LL*TFLKHLWLGAVARAF 
NPSTLGGGGGRIA*GQEFKTSLGNIERC 
RLYKNLKLFLESGLFLMAWRGAKRVRTR 
G 


7896 


21797 


A 


7956 


171 


386 


LPIRLFSLVFTTIEKSRPGMVAHTCNLS 
TLEG*SGWIT*AQEFETSLGNMAKLSLP 
KIQESASLITRSQNK 


7897 


21798 


A 


7957 


397 


283 


PQPLPPGLKQFSCIjSLPKC*DYRRDSAR 
PAKKNPTFNK 


7898 


21799 


A 


7958 


169 


324 


gthrvemcsqwlgwahagdps plggqv 
rkit*aqefgtslgnigrpclyik 


7899 


21800 


A 


7959 


290 


397 


DCS SCCS PLKGR*RG* AQWL.TP VI PALW 
EAEAGGS 


7900 


21801 


A 


7960 


496 


298 


lsktgslyvaqswywlftgtimvhdsii 
kllcsssppvsas * vagttgahrccevl 
ttfqpdvyrl 


7901 


21802 


A 


7961 


404 


174 


PGP QKGS F F * KKKKKKKR I DH I LSHKAN 
LNKC*RIKII*TMLSDQNEIKLEIIIKR 
QLLKETLFLEMKKYTTRKLS 


7902 


21803 


A 


7962 


25 


99 


PGLFLSS *HQVILIiPWPPKVLGLQL 


7903 


21804 


A 


7963 


436 


106 


GGG KS P P P GNWAP P GNPGE P L F F * KKKK 
KMPGGGGGRFFP P P PGGRKKKGVYPGGG 
GFYLTNFPPPPPPWGKKKNFFFKKKKKK 
KKKKGRIAICYIILFITSINFKKAFI 


7904 


21805 


A 


7964 


212 


395 


ERERKERDRDGEREREREEREEGRKKHG 
GR*GGREGERHRPRGAEDPDRAGRATDK 
RTGAL 


7905 


21806 


A 


7965 


344 


345 


FSDCYKRSSCASWTWYTRLAHHQPPARP 
GAAPS PRKRWVDFAVNRNFPSRRHPCAT 
PRPAPGATGRPSRVPKSTAFTQMLTQPR 
DPHLPLPIPPLSQPPPSP*PAPIiVMR 


7906 


21807 


A 


7966 


397 


298 


RGFTMLPRLVLNSRTQVI*LPWPPTVLG 
SLSIF 


7907 


21808 


A 


7967 


92 


268 


IKMSLCPNFFFFFKKQNVGPRAGAYPCN 
PTILGGQGRKIT*GQDLKTTLVNMGKPH 
FY 


7908 


21809 


A 


7968 


257 


107 


RTHTHTHTHTHTHTHTHTHTHTCENSKT 
TKNIANNCS * LESQRNNGKCTYAKF 


7909 


21810 


A 


7969 


413 


12 


PPPPPLIiTPWPCFFPQIL*PVGHSAPPL 
FK*LKI PRPFFFPPPPKGIGAPPPKIFF 
PPGFSLPPFVKLPPGEKILFPPPKKKKY 
PPPPPQS*SF*PPPPPPFFFFFFSTSAK 
KS KKNFLF I FS PFFNPKYYNT 
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Amino acid sequence (A=AIanine 
C=Cysteine, D-Aspartic Acid, 
E=Glutamic Acid, F=OPhenylalanine, 
G^Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Tnreonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Llnknown, Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7910 


21811 


A 


7970 


399 


212 


TGVQWSNHSSLQSQTPGLKRLFLLIIjLS 
I WD YG CMP P Yh S FKN * FCTKGGS CYVAE 
ADVNIL 


791 1 


21812 


A 


7971 


584 


424 


RPRRENRLNPGGGGCSEPRSRYCTPAWV 
TE *NS VSNRNKTNQNP I FFSCKQFD 


7912 


21813 


A 


7972 


105 


1 


P S M P Q L S HL * NG I LGRVRWLT P V I P AL W 
EAETGRS 


7913 


21814 


A 


7973 


363 


28 


SS S LGGHPQLW FHTLKLRP VT VS YA* NK 
LCDKI E KKLFFT IALHNKFLRKKLTRDV 
KILFNNK*IKKEGPNEF*NIIjCLiWIERI 
NINKVPYPSKvMYRSNTIjPIEVAVFKSQ 


7914 


21815 


A 


7974 


404 


250 


FFFFW*R*GFTMLPRLVLNSWTQAIHPP 
ALAS QS AG I TGMS HHAQPLS I I F 


7915 


21816 


A 


7975 


2 


60 


FSCLGLPF CWD YRHAP PHL ANF CRD GVL, 
NY * PQ V IHLPQLPKVLGL* ACTTTPG 


7916 


21817 


A 


7976 


325 


81 


KTIHSLLFGQFFLLQPLLPSPIPHAAPP 
TPHL I FLLLLFCF * FFK* RQGKQGLTTL 
PKLISNSYSQVTLLPQPPKVLGFQA 


7917 


21818 


A 


7977 


406 


282 


QWRDLGSLQPPTPWFK*FSSLSLPNSWE 
TKAGRSREPGGGD 


7918 


21819 


A 


7978 


382 


221 


DC I I S AS YLQKNF FFFLGDKVLLCHPGW 
S AVA * S * LT VTS KR VHF YS S E I S L Y 


7919 


21820 


A 


7979 


423 


29 


FLW*RRGFTSLPRLjVSNFWAQGICLPWL 
PKVLGLQA 


7920 


21821 


A 


7980 


3 


238 


SLAFF VETGSHFVA*AGLELLS SSNS PA 
IiASQSAEITGVSHHTQPE*GYSHM*PQY 
PYQLRFNSDTI ILSNVLFVPNF 


7921 


21822 


A 


7981 


324 


128 


WLSVSPYTFELSLLVGYEFVIHDVI*TL 
I FWVVVVSLFCFLWFFLVCMVFWFFC 
CCFLSVFCI 


7922 


21823 


A 


7982 


1 


86 


NLTMLPRLVSNSWAQAICLSWPPKVLGL 
*AQAICLSWPPKVLGL 


7923 


21824 


A 


7983 


1 


137 


RPANF C I FF VE TE F CHVAQAGLE L.LG S G 
DPPTSASQGAGTTGMSH* D * PANFCIFF 
VETEFCHVAQAGLELLGSGDPPTSASQG 
AGTTGMSH 


7924 


21825 


A 


7984 


2 


277 


PRVRS VQR V I YQ Y V * NLQ I HVP F DS TNL 
G I YLTD I L PHVWNDKTMYKVFAAVS F V I A 
KY*NPCLTRRQCNKLWPIHAMEYYVTIK 
KNEDGRA 


7925 


21826 


A 


7985 


211 


42 


GPQKWPRGPGENLGGFFFFFFFCRDDGL 
TMLPRIiALNS*VHAILPPQPRKVPGLWV 


7926 


21827 


A 


7986 


409 


183 


LLRLK*SSHLSLPKWWDYRCEPPCPAIF 
FKRKQILKESFPGENTCCHLFNSLEVLK 
NKRRTCYNFPNTLAVIiKNM 


7927 


21828 


A 


7987 


2 


194 


RFFETQSRLLQPPPPQDK* SSHIuSLQSS 
wDYRHVPPCIiANFFL FW YFLQGQGFAFL 
LKLKFLFK 


7928 


21829 


A 


7988 


483 


209 


SSPSLGNFCIPSSGGVSPCWPGWFQTPD 
LVIWPPGPPSLVFFFFFFRDEGLSMLLR 

T .TT ,"NTF * A O V T T >T . POT t P F 1 * T .(TT .n A f)P n\ TT> 
LYPAEVF 


7929 


21830 


A 


7989 


164 


29 


KFWFWLGTVAHACNPSILGSQGRRIA*A 
QELEPSLGNEGRPCLYK 


7930 


21831 


A 


7990 


3 


474 


PTRAPLRAPFETIVQSEEFSKHHFGFEA 
AA*YWHFVDW*LFLYVSIY**GSPPPP 
PKILIKKNGAETKKFFF 


7931 


21832 


A 


7991 


469 


323 


VETGLHHVAQDGLELPTSGDSPASASQN 
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Amino acid sequence (A=Alanine 
C— Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F~ Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K—Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














IGITGVSHHA*PDFTFLEQF 


7932 


21833 


A 


7992 


469 


13 


KGDTVS SSDSSPS CGGLFPVGAS * LL CL 
FTQALAMVGARRPFSMLPCSLISDCCAS 
NERGSVGMGPSKPGVGYSLLVCRWLRPL 
EKRTIKVGVTRFSRCRLSQLPLARKGNS 
LTPCVFWVRHCFGSHSVGCTHCPAPAVR 
QAPVK 


7933 


21834 


A 


7993 


15 


439 


I AS GR P FS I KF FL VAI T FLLFDLE I ALL 
LPLP*ALQTTNLPLIVMSSLLLIIILAL 
SLA YE * LQKGLD * TEPNKKKKNKKKKQE 

GGKNMFFFFWGGEKKTPGGFLGENLFLG 
GGK 


7934 


21835 


A 


7994 


392 


186 


YFYI INNFFFFKKIFI* * 1 1 FFFLKKFL 
*YSPKKVFFLFF* IFFFFFPSSSLFLFF 
SSSSPFFFFFFFFFFFFFFFFFLRG 


7935 


21836 


A 


7995 


11 


386 


TWE VEVAVS *DHAI ALQPGQQE *NAI S K 
KKMTGRMF IE VS F 1 I ASS WKQLKCS * TG 
E*MKSLWYTHI IEYYLTIKENKVLEHEV 
S RLTL I DTGRERS KTKE YTL YNP I YTNL 
RIAKLSGCGGSHL 


7936 


21837 


A 


7996 


3 


186 


DSFCF*RQSCSITQAGVQWGDRGSL*PQ 
S PGL KRSSCFSLP KHWDHRDE P LHPAG V 
AVLN 


7937 


21838 


A 


7997 


316 


649 


RDHSFFFEMESRSVAQAGVQWHDLGSLQ 
ALSSGFMPFSCLSLPSSWDYRHPPPCPA 
NFL YFLVEMGFQHVGQNGLDLLTS *STR 
LGLPKWLGI TGVSHWHPAQDGGFHHVG 


7938 


21839 


A 


7998 


468 


336 


RRGFTMLARMVS IS* PSDMPASASQSAG 
TTGVNHHIRIHRSLL 


7939 


21840 


A 


7999 


464 


283 


TL CGG I RP TNI MKGNLL YS KS I GLNVS L 
LPKNPRRNIQNNV* PNTWALQPSHVDI 


7940 


21841 


A 


8000 


46 


487 


RQGRLSLQKFLLPFVQLCPAHRGGVYRG 
RQASLSCGGLHPVRVSQPLCLPT*VSAM 
AAAPHPASLLPCSLISDCCASSEQGSVG 
VGP S KPGAGYNL L VCHLLRLL E KRS I RV 
GVSQFSRCHLSWLPLARKGNSRTPCTFR 
VRRCLAL 


7941 


21842 


A 


8001 


295 


1 


TQPWGTKRKDPLKKKKKNSGDREWRWLC 
NS VT I LNATTM YTSEWLRPGTVTHTCNP 
KTSGG * GGE I A* TQE FKAS LGNI A* SWL 
YKKILKISPLWWQT 


7942 


21843 


A 


8002 


144 


317 


ELFYLKEMV*RPGPVAYACFPSTLGDRG 
GR I T * AQE FRTDLGN I ARP C VYLRKKKK 
RG 


7943 


21844 


A 


8003 


69 


291 


HGVWTMI FI VIKYFYLNPQNNSMRYYYS 
HFMDKETDVQSD * MTC*RLGAVAHACNP 
STLGGRIA*GQEFKTSL 


7944 


21845 


A 


8004 


3 


176 


IVAHSSLTPGLKQFSCFSLPSS*VYRNM 
LPRLANY*FFGTNRVSLSCQGWS*TLAS 
LLGS S S S PAS ACQ VAKS TGTC YHAWLI I 
DFLVQTGSRCLAKAGHELCS CL 


7945 


21846 


A 


8005 


419 


178 


NPPPKKLFLSSSSSSPLPHLNWGTPGFF 
P P PP F * NP P PE FNFGAPKKKKKL S P PRA 
EKLVPFKGPPPFFFFFFF 


7946 


21847 


A 


8006 


429 


225 


QPLPPRFK* FFCLSLPRR*GHRHRPPHP 
FNFF F F * YFLR WS FVLVAQAGVQ CMDLS 
IYLNEFWLLDF 
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ding to 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cyste\ne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=JLysine, Lr=Leucine, M~ Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7947 


21848 


A 


8007 


756 


224 


PPRHQPQHVPPPLLHFSTPSPAPPKSHA 
AAPNSNPPQAHAPPAPPAPRAHSPSPPP 
RGAT P P FPP P P FS FLAAVGRAP I FPRE P 
LLPSSSSPPPPP*IGEPPGFSPPPLLKT 
RP P K* N * GP PKKKNFFP P PRPKNWFL * K 
GPPPFFFFPFFFLRQSFTLVAQAGVHKT 
RLHLKKKSN 


7948 


21849 


A 


8008 


170 


559 


SSHLSPQSSWDYRSGMVAHTCNPSMLGG 
QDGRLT*AQKFQTSLGNIVRCHFYFKKY 
FKNKIFFYSPFINTNPKKKKKKGGAVLK 
NQSLRPRAGKVLVFLWGHLIQFRGPRFK 
NGGTGKPGGLPNLLAPR 


7949 


21850 


A 


8009 


1 


662 


RTRRTRSWWYMPVLPDSHYGEAGESLGP 
* KWMVD * API APLHSTLGDKMTLPD I KK 
KNGETVANTS PLSTTAKPAKLFLKAKKD 
QLMRDLFLPKTPKKPQMS TGLDARSKRW 
LKIIWRRHGIWPLKNIGPTEDVQASAHG 
GVEENMTSDIEIPEAKHDHRPTEDVQVS 
AHGGVEENITSDIEXSEAKHDHHLVEDIi 
SESLSVCLEDFMTIGSQWKPICLS 


7950 


21851 


A 


8010 


377 


2 


NFFFFFPPFWGFFFFLGPGFFKVFF* I F 
PFFFPRNPPFF * FFLGVPLFFKK* I FFF 
PFSIRFCFFPKKFFFFLILFFFF*FPFF 
FFFVYVFDFFFSFILYIFFFVYFSLPFT 
VLIISXiIYHFAPA 


7951 


21852 


A 


8011 


1 


363 


PTRPLF* * RGCFTMLPRL I LNS WPQVI C 
PPRPPKVLGLQT 


7952 


21853 


A 


8012 


278 


39 


FXXXFFLSPPXFLIFFFFFFFFFFNFFF 
FFFFPPFFFFFFFFFFFFFFFFFFFFFF 
FH*DQ*FSNFDFTPFTFTLISNII 


7953 


21854 


A 


8013 


79 


412 


MMECILGKGGKDGGFFEQCENLGLC*FIi 
KVP * D S D I WLGAVAHAYNP S TLE GQGER 
ST*AQEFKTRLGNMTRPCLYQKHTHKKK 
T KKKP H P GLGWGAWG P T YLGGVGT K I T 


7954 


21855 


A 


8014 


277 


19 


FGNIHNFFLIGSRSVTQAGGQWCNHSSL 
*PQTPVLKQSSHLSLPSSWEIEAAVSQV 
R I TALQPGRQS KTL PGNSGP VLRADARR 
SI 


7955 


21856 


A 


8015 


1 


130 


ARIVSIS*PCDIiPASASQSAGITGISHH 
TQLAHNTLKAS PTLF 


7956 


21857 


A 


8016 


4 


417 


RLMPPHIANFLFF* RHGFTYVAQVGLKL 
LGSNDPSASTSQSTGITDVSHCTQPHLL 
KSSHSTFPLKTLDSMATNCSWDKVHYLP 
TACRALLPCPLPSACFPSLSQVPSVFTC 
C VPATLTI TliLPLIiRAFEHVTLCPIiF 


7957 


21858 


A 


8017 


388 


158 


CVTWASNCINWIGFPLS + INSHLCMEII 
VLI*GEIYNVCISIYLCIYMCVCIYISV 
CVCVC I YVH* TP YL IHKCL I 


7958 


21859 


A 


8018 


154 


382 


G I KRPGAVAHNCNP STLGG* GKW IG* AQ 
E FKTS PGNMVKPQ I YIiKKKKKKGGAVLK 
DPSLRPRAGKV I V FF WGPL I 


7959 


21860 


A 


8019 


396 


237 


FFFFFFW*G*GFTMLPRLVSNSWAQVIH 
LPWPPKALGLQVRTMVPGLRLANF 


7960 


21861 


A 


8020 


412 


168 


GNL,LGSPLPGWCKNKPFILF*KKK*SL 
AVLPKLVSNFWP*MIL J PPWPPKI*GVQA 
RAPPPSQILKKKKSSHFYNSHVNSL 


7961 


21862 


A 


8021 


1 


124 


TRFLHVS QAGL * LLTSGDLPAS AS QS AD 
YIFIISPAKIIFL 
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sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *— Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7962 


21863 


A 


8022 


3 


200 


HLSP YFYFFKDRVALCHPGWSAWQ* P I 
VTS I FWSS C* KLTS WRRWRP SWLTR*N 
PVSTKNTKN 


7963 


21864 


A 


8023 


3 


132 


FLFFFQF* FMRLGLAMLPMLGMNSWA* V 
ILLPQPPEWLGLQA 


7964 


21865 


A 


8024 


3 


187 


VGQAGLEPPTSGDLPASASQSAEITRMS 
HRAQTK*DFIGSFSEEFS 


7965 


21866 


A 


8025 


405 


231 


SDKWIKKMRYIHTTEYY+ALKRKEILTH 
ATTWLNLENILLSEISQSPKEiRYYVIPL 
T 


7966 


21867 


A 


8026 


175 


382 


GKRI FPPYPPQEGGKTGPPQTPGLFFFF 
LKKK*PGGLGGARFSPLLGGVGRKNSFT 
LKGGGAINLKGPPSLPPGGKRWAPPQKK 
KKKGQAWWLMPVI PALWEAEAGRS 


7967 


21868 


A 


8027 


83 


374 


GDQVWWLMHACNPSTLGGQCGRITWDRE 
FETS LANI F FP 1 S TKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKAS S GGGVS * K 
KNFSPRAGKEIFF 


7968 


21869 


A 


8028 


382 


148 


SPHCSLDLVGLKQSSHLSLPST*DHRCV 
PPRIAIFSFAFLFFWEATTTTTKATMLP 
RLVLNS * AQAI LLPQPLKVLRL 


7969 


21870 


A 


8029 


88 


208 


SGS V*ENNPLS WAWWLQS I IPATQEAE V 
GRLLEPRSSRSA 


7970 


21871 


A 


8030 


1 


283 


NKSRI KKAI KMTEKMKEKIEMMKLMNQL, 
PNGEHHAM I QMMMKI EMKGRMKI KE * DK 
ESDKDDRKDEREDRDDETDEPTPERRTS 
RDDSDDDEDRDEGKNEDKRKDDSKDDDE 
ADEDTDQDDYDPMDAHKAADEDDDEDED 
EVAEQMTKKKKKKKKKKKKKK* *KDDRK 
DEREDRDDETDEPTPERRTSRDDSDDDE 
DRDEGKNEDKRKDDSKDDDEADEDTDQD 
DYDPMDAHKAADEDDDEDEDEVAE 


7971 


21872 


A 


8031 


2 


139 


LQTLLPLIAF* ^LLASLANLALPPTINL 
LGELSAQFAVIGMYILY 


7972 


21873 


A 


8032 


80 


324 


YLHFYFFVR* KLMGLDTVAHT YNPSTLG 
DPERRIA*A*EFKTSLGNMVKPCIYKKK 
KKKKKKKKKKKKKKKKKKKKKKKKK 


7973 


21874 


A 


8033 


299 


158 


GNQPGASAHTCNPSTLVGQGGRII *AQE 
FKASLGNMVKHYLYQKHKN 


7974 


21875 


A 


8034 


416 


60 


AQWLTS 1 1 P PL * EAE VGGFFEPRSLS PA 
WATE 


7975 


21876 


A 


8035 


398 


271 


FYFL*DEVSLLPRLVSNSWTPAILPPWP 
P KVP GLRHE P PR P A 


7976 


21877 


A 


8036 


385 


151 


FLYFL*RLGLTLLPQLVSNISSSDPPT* 
ASESAGITGVSNHAHLRICILTRLPGNS 
SH*NVRRAGLGDTQSSALIPPG 


7977 


21878 


A 


8037 


3 


406 


PASAS * VAG I TGTHHHAQL I FVFLVETG 
FLFFFFWAVFLFFKTKNGPPGGQLFFFA 
APPALGGGDFGGQGATFSPKGLGVLGET 
RGGAPAP KKLGAKKE PSHLLGGGAQNL P 
KPRGQ KG FGF Y F Y F LARD F FLG 


7978 


21879 


A 


8038 


428 


20 


LANQ*WKPPSNPWETKISPKKKKGPPGQ 
GPPPLTPPLWEAPKARKLGSPWLPR*NP 
LFPQKPKKFFGFGGGPPYSPLPQKLSPK 
NGVTPEVGPSHKPKFPPSPPPLGQK*TP 
FPQKKKPKRLKRGMFLHGSTFQSL 


7979 


21880 


A 


8039 


3 


194 


QNKI S Q VWW YT P V I LAT WEAEGGE S VE P 
GRQRLW*AKTAPLHYSQDEKYLKASVDS 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, K =Stop codon, /=possibie 
nucleotide deletion, V=possibIe 
nucleotide insertion 














YCLISCH 


7980 


21881 


A 


8040 


415 


96 


LE PL * KYFP ILDAI KNICDS WEE VKI S T 
LTGV* KKL I PMLMDNFEAFKTS MEKVTA 
DWE IGELELEVEPADVTELLQS CAKLE 
GMRS CFLWMSKESGFLRW IYSW 


7981 


21882 


A 


8041 


413 


254 


PGFKQFS CLGFPKR * D YRHKP P CPASKY 
FKINYLRNIYRSLILKN 


7982 


21883 


A 


8042 


409 


102 


GFFPFGPFFF*PPPPGVFPPPPPPNFWF 
*GFGPPPPPLF*FFFRGGPKKIWFFPFF 
LWPPLFFFFFPFWPKKTPPKKARSLKIF 
FFLGPKKIPNPFFFFFFF 


7983 


21884 


A 


8043 


414 


153 


GRGWATNFYFFL I ILK* FFF I FKKFFFF 
HLFFFFFFFFFFFFFFFFFFFFFF YKI F 
LFTLVLTCPQTS CREAEP FDHKVCS VLK 
ML 


7984 


21885 


A 


8044 


401 


292 


NYLPPQ*KKKYYPQGPFRKPIKKSPPPC 
FFFFFFFFPSFPPFLNIPPPQKGKFPPK 
KYFFNPPLFPPLFFLKPPPLFFF*GPKK 
KNLIFHPPSKKICPFKRGPP*FFFFFFF 
F 


7985 


21886 


A 


8045 


370 


14 


TPPPTKTPFFFLYPPPWREKREGFF*KA 
NMSPPVWKKKIPPLPPYIYL*KGGSRYL 
AKLSLNLKGSIYLSFCFPPLPPSLPLSL 
FERWGLTILPKLVLNSWAQMILLPWLPK 
VLGLKV 


7986 


21887 


A 


8046 


395 


268 


PFFFF * ETGSCS I VQVGVQWLFTGTILV 
LISVGVLTCSISDIj 


7987 


21888 


A 


8047 


379 


98 


GPAHNSPPLGGESETPLGYPGKPRFF*K 
NKKINPARGPGPWPPPPQG*AGKRPLP 
PRRGFQLTQNGPPPPPPGGKKKPPFQKK 
KKKREPAIC 


7988 


21889 


A 


8048 


369 


177 


RNTRGGPTFFFFFFFFK*NLARLSRLEC 
HGTISAHCNFRLLGSSDSLPLTSNDIGQ 
INKSLFA 


7989 


21890 


A 


8049 


1 


128 


ALGLVAHAYNPSTLGSQRGKIA*GREFG 
TSLGHRARP CL YKK 


7990 


21891 


A 


8050 


166 


163 


P VHQGEQTTQDKCLR * S THLGLP KG * D Y 
RRE P P C QAS KLVP TG I LAD S KHS KCQTM 
DSCFSNQLYKQR 


7991 


21892 


A 


8051 


402 


245 


YFKS VCQAQWLNT YN"P S TLGG * DGW ITC 
GQEFRLPWPSGITGASHHAQPHFF 


7992 


21893 


A 


8052 


404 


268 


QQLWRLRQRDCLNLGCGGCSEPRSCYCT 
PA*ATEPDPVSNKQINK 


7993 


21894 


A 


8053 


1 


154 


GFLRVGQAGLELLTLRDLPTSASHSAGI 
AGVI YRAWPGEAYLFSG * WLRFT 


7994 


21895 


A 


8054 


406 


150 


PKKKKNPCPPPEVKFDSLKRAPLFFFFL 
EESCFVTQAGLECSGAILAHCNLCLVGS 
SDSPLSL * RWVGNSHF I FTSLLS VS PRS 
N 


7995 


21896 


A 


8055 


1 


125 


TAVI L 1 1 FM I * EAFAS KRKVL I VEE P S I 

"NTT.T7*T VPPDBDTl-T 
JM Liej lj X \j c ir Jr ± ri 


7996 


21897 


A 


8056 


400 


260 


HMSPHLAFFFSFLFL*RLDMLPRLVSNS 
WPQAFLQLLLPQVLGLQV 


7997 


21898 


A 


8057 


396 


263 


FLQPGT VAHACNP S TFRGGL IT* AQ E FK 
TS LGNKVRP CLYKKQY 


7998 


21899 


A 


8058 


402 


211 


FS CWWCTKPKAAG* FPAP WGPPFLNNPP 
GAGTNFPGPPRSGGFGSPPPPWPGPRRP 
CPNGIFFC 
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peptide 

sequence 


Amino acid sequence (A=AIanine 
0=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=Phenylalanine, 
G~Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *— Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7999 


21900 


A 


8059 


I 


279 


VLLYHPGWSAWQSWLMQ* TPSNADI WR 
IARGFKRSSHFSLLSSWDHRHVTPCLAN 
FFLIL* IWGLAMLSRLLPLRLRKPVLLD 
*APSPCGLI 


8000 


21901 


A 


8060 


408 


315 


SLTMLPRIVFNS*AQAILLPWPPKVLGL 
QV 


8001 


21902 


A 


8061 


198 


2 


RIQHVESNCHLEVEGGWAQWLTPSTLGG 
RGRMIS *AQEFETRLGNTVRPCFYKNLK 
SQPGMWRI 


8002 


21903 


A 


8062 


3 


294 


GLSPTAHPITAFTSQPLPSGLSVCRLFL 
WAMCEQCPHLPIRPCCPRI IGPECALYT 
TS CLLHQLKKREGF* KKKKKKKKKKKKK 
KKKI F * WGGGKK 


8003 


21904 


A 


8063 


197 


184 


IHD FLFFF *GRVS LCCPGWSAAAQSRL T 
AL* PGELKETFYL* IRRSWDHKHAPPLP 
APF*LFWVTLIITPSTHEVSSAVCSLLY 
VKSEQKCQ 


8004 


21905 


A 


8064 


3 


373 


TSGSLAKRPADCLSAQPTTPSSLCGCSP 
VRVC I T VPRCTHARAHTHTHTHTHTHTH 
SNHG*APGALYIQELQRGKVSHVPEKEK 
SDPETA* TKGG* HVTNPHGRQRSGEQEE 
HRICVGRCCSG 


8005 


21906 


A 


8065 


532 


106 


ENKP I HNLS I NKY I GK I LRD KALL Y C P G 
WRAVMQS*LPVASNSWAPAILLCPGFLS 
TWDYKRVPPQLVKIFSKKKKNKPTNILR 
PKCHTRNQDIYKRGKVGFLCSYCPPALL 
LLLS LLLLLLLLFS FSTRD S AS GAGCS S 
LP 


8006 


21907 


A 


8066 


411 


130 


PHAQLIFLRR*SLTVFPRWSNSWPQAI 
LPPWPPKVLGPQARAMVPSPKLVLESIL 
RIP TS VI FFPLNL CSNI KLAE I KEGCS A 
AFLFNQRFAN 


8007 


21908 


A 


8067 


3 


32 


DAWADAWGSLTDE W VEK I WYIHAMQ YYS 
S FE KRE I L * YVRT CMNLEDTMVS EI RQS 
QRDDYCMI PHI * GRTRGVH 


8008 


21909 


A 


8068 


410 


116 


STFNIQTWPGTVAHTYNPNTLEGQAGRI 
T*GQELETSLGNI TRPNFLLFCQACPGQ 
GLSTVRMCGSLHYFVLPLRPLLKCLDNR 
KSPFYHPVILPCYE \ 


8009 


21910 


A 


8069 


158 


289 


DLCSSLRNIVRPQLYKK* KISQVRKKKK 
KKKKKKKKKKKKKKKKKKKK 


8010 


21911 


A 


8070 


388 


1 


APTPPCFLFKPFVLGGPPFGKAFFQKFP 
GGGFFPSISGPKPPPPRNPSLISKGKRV 
PSPQSPIMVFPGAF*RPPPFPLMVG*KI 
KIFLPPPPPFKTPPAIjSPFFFFEAEFRS 
CWP G W3STAVMRS RLS ATS 


8011 


21912 


A 


8071 


15 


336 


KLD KRYDRTG S P GT THAS G YL I E PL VAE 
ASYELILSLAFFFFERRGLALLPRLEGS 
GVIVAHCTLKLLGSGGPPASAS * VAGTA 
GMNHHEGHE PGL FE KQNS GF I F 


sm i 
o\)lz 






QA7T 




2 /4 


VILSPWPPKWVL 


8013 


21914 


A 


8073 


189 


54 


NKPLGPGMWQACNPTPLGGQGQWIP*A 
PEFKISLGNLAKPCFY 


8014 


21915 


A 


8074 


3 


257 


HAFVLLFETRS CLS LRLE S SGAM I AYCR 
LELLGS GDPLTSAS Q VAGTKGM * DDAWL 
RQVSNSWPQVILRLSHPHMFLFQFSLET 


8015 


21916 


A 


8075 


1 


119 


TLPATWEAGAGGLLE PKSLRV* CVFTAT 
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Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, HHHistidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon s /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VNSSRTPAWAA 


8016 


21917 


A 


8076 


232 


266 


LPYCSGQWCLGLPIIQGVQWHDHGSLQP 
RLPGLNQSSCLSLPKCWDYGREP*YLA* 
YQAR YHGSLARQNVFNI RPG I MVHARS P 
STLGG 


8017 


21918 


A 


8077 


390 


143 


LGGFLVFCPPAKKGFFPNPINLGTPRFF 
PPPPF*KPAPEINFGGPKKKKIPSPPPG 
VKFDPLKRPPPFFFFFFFFFFIRLLV 


8018 


21919 


A 


8078 


424 


144 


IjKNYI FFTFL *NRDRVS LCCPRLVLNFW 
AQE I LLP WP P KVLGLQA 


8019 


21920 


A 


8079 


182 


45 


VHKAGMVAHAYNP S TLRGRGGWIT * GQE 
FETSLASLDNMVKPRLC 


8020 


21921 


A 


8080 


405 


163 


YFLEEMGFHHADQAGLELLTIGDPPGGM 
SHCT WLNLNVNL I QKHPHRN I QNNV * PH 
ISGPVKLTHEGNHHNWHMSTQKHSP 


8021 


21922 


A 


8081 


39 


240 


QS KTVS KKKRKI I FCRYRVS * S PGFKQS 
SHLGLPKGWNYRREPPHPASTFSQSFPV 
KDHFCSGVSS 


8022 


21923 


A 


8082 


2 


263 


DC CVS SE * GS VGVGLS E PGAGYNLLVFR 
LLRPLEKRS IRVGGSRFSRYYLSWLPLA 
RKGNS PTPCTSQVRQCPTLLLGLHPLCD 
KPQ 


8023 


21924 


A 


8083 


1 


248 


SVGVGPSEPGA*YNLLVCCLLRPLEQHS 
IRGGVSQFSKYRQSWLPLARKGNSPTP* 
AS CGMRG PALL WL TLLG LHPL SNKPQ 


8024 


21925 


A 


8084 


3 


255 


HLSLPSSWDYKRMPPHLANFFFFFLGKK 
GVLP*GQTGLEL*N*KNPPA*PPKGAGV 
KGVTHRP KP * NGGKGRFE KDT PGGP FL 


8025 


21926 


A 


8085 


409 


145 


LRGP YKKNFSFQAPGRKIGS FKKAP PFF 
FFFFF*GQSLCCPVGLSTGVI IAHGSLE 
LLTS S D PAS S AS RVAGTTGACHCS * VLQ 
TFTF 


8026 


21927 


A 


8086 


199 


26 


GFPPPPNKTEIF*RPPDFLNFFFLVEMN 
LTS LS RLVSNP WAQAVLLP WP P KALGP Q 
A 


8027 


21928 


A 


8087 


222 


3 


SVGITGVSHHVLWYFTLKGFLGRGAHSC 
NNSALGGRGGW I T * AQE FETS LGS VARH 
HL YK5CYKN Y P VAWRL 


8028 


21929 


A 


8088 


415 


197 


TILCFYHQNNSVFSSNPFLSL* IFIF*D 
RVSFCHPGWSAWQSWLTVFPILCATVK 
YFTCQQYI IMI FALNS 


8029 


21930 


A 


8089 


297 


71 


FIFPENKNRPE WAHACNTLRGRGGRI T 
*GQEFEKSLSNIARPHLYKTKTQNILSK 
YIQYLGFQLTLQIDLANIY 


8030 


21931 


A 


8090 


185 


415 


PFGLKQSPHLTLLSS*DYKHTPPCRVNF 
FFFLGGFGFFPKAGFNLGN*RNFLASPS 
KRAGMAG INNLAGP I FRFFKK 


8031 


21932 


A 


8091 


2 


125 


RKQAS KKERKE * KK* RKKERKKERKKER 
KKERKEKERKEEP 


8032 


21933 


A 


8092 


8 


412 


KGPQGTTLTCIFDLSYSSSLSHWSQSLC 
VAFPLPECIFPHKPLCPACHFFFFQSWG 
LALLPKREYRGKIGVNGNLELLG*NNPP 
S GTS K I AE P AE L FHHTHARP FNW 1 1 KTD 
FPYIAQTGF*LRPSRNPSTLEV 


8033 


21934 


A 


8093 


249 


143 


N I FAATWMRLET 1 I LSEVTQG* KTKHQM 
FSLICGS 


8034 


21935 


A 


8094 


3 


280 


P ERWD YRHE P PGP AS FT * LLLFVFNL * R 
KGLYVGQRGRQLLASSSLPAPASLSAGI 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine J H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IRMSHCTPAWVTVRPCY* INK* INGKDT 
IRQNLKKM 


8035 


21936 


A 


8095 


404 


16 


QMGFCLFFLPSSKEVFFPPIFFFYRESF 
FL FHFFLDPRQY * L YF 1 1 KKN I FLPS P F 
IFFLFIRVSPYIFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFWGSMEL 
LSKAHQCISSHGSG 


8036 


21937 


A 


8096 


358 


200 


KSYFIFSLII FSHYFFLLFLKHLLATYY 
CFLIDCTLFFSFYCPFCCLFVLLSGRFP 
RS CL * T FH*RLYFS FP I FNVQKLF* LSD 
rCFCCIYLFIYLFIYFETVFCSCCLGWS 
MMA* FNGMFKDRI 


8037 


21938 


A 


8097 


2 


132 


KEEKKERKE*KKRKKERKKEKREKKRKR 
EGGRKEGKEGKNSST 


8038 


21939 


A 


8098 


48 


331 


PGTWHACLGGEDILKWQTKGRGRRAGSH 
RTS WI E P LET VFL ELEAGGRLE S PGKN Y 
SRLGAVAHICIASTLGGRGRQIT*GQEF 
DTSLADFCST 


8039 


21940 


A 


8099 


7 


373 


NGRLRRGFAMLARL VLNF * PRDPPT *AS 
QSAGITGMSHHARPTIESFIHYCWECKM 
E*PLCKLILYFHLFIFKRQKLGVHWCHH 
SLLQPPTPGLQRSFRLTPPSGWDYRRGP 
PCPANI*LII 


8040 


21941 


A 


8100 


116 


432 


YLRNLFYQRQAFS PAILNLMCS AL*ESH 
TVAQAGVQWHNLGSLQPLPPRFKRFSCL 
S LPS SNI FAPLS YFFRNS S FTNT SPSEI 
GPQITDPLFI KKKI KVS VMVE 


8041 


21942 


A 


8101 


361 


138 


FNLLGRKKKNFGGLKEGE S RRP PNPGPQ 
FFFFFLFFFFFSG*RWGLTMLPRLVLNS 
WAQVILLPWPPKSLGLEV 


8042 


21943 


A 


8102 


1 


199 


PTRP F VFLMEME F YHIGQTALELLTSGN 
L PAS AS QS AG I TGVS DR * LA* YLFF FHE 
LFCYFALLLI 


8043 


21944 


A 


8203 


412 


46 


KGGSPPPPPPPKIGRKGPPNPGDPFFFF 
SPFFFRGF*GSPFPPK*KVLLVGFLKPK 
PPFPLGWGNPPPPPVGGFSPPQSLQFFF 
FKKVFGWVFFFPLFFFFFFFSPFFLFFS 
ETESHHIAQA 


8044 


21945 


A 


8104 


192 


2 


EETELSFFIIKYYLPRNQKVCKRL*RRL 
GPVAHACTPSPLGGQREW I TRSGVQDQP 
GQDGETP 


8045 


21946 


A 


8105 


373 


183 


GGPF* FQYILILFDNFFFFSLSFPFSQD 
FFFLSSFFFFFFFFFFIINYFYFF 


8046 


21947 


A 


8106 


373 


281 


S FKTAP P FFFFF FW * RLSLTML PRMVSN 
SWAQWLLPQPPKVLRL*ATQRKKHFRQ 
EQI 


8047 


21948 


A 


8107 


3 


114 


FLIFVHIGPNYVAQAGLELLSSDDPPAL 
AS QCAG I TGMNHHARPYYLL * TLNIPLP 
YDPEIFLLGIYPR*MKTYVHTKACT*FF 
ITPLFVIAKTwKQPK*PSVLGLQA 


8048 


21949 


A 


8108 


416 


243 


PGVFPPPPF*NPPPEIIFWAPKKKKYFP 
PPPPLNFVFFKGPPPFFFFFFFFFFFFF 
L 


8049 


21950 


A 


8109 


2 


211 


S VTQAGVQWHVSLLQ PRLPGLEQ S S CLG 
L**CWDCGHEPWHLALVTTSKHTESCSL 
LCI PTATFLGLVT 


8050 


21951 


A 


8110 


172 


28 


HWC F I T S F FGF YFL * RQGLTKS P RLVP 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Iso leu cine, 
K=Lrysine, L=Leucine, M=Methionine, 
N=Asparagine, P— Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W— Tryptophan, Y~ Tyrosine, 
X=Unknown, *=Stop codon, /=possibJe 
nucleotide deletion, \=possib!e 
nucleotide insertion 














NSWV*VILLPWPPKVLGLQA 


8051 


21952 


A 


8111 


2 


225 


QKSAKNFYHLS I YLS I YLS IYPS IYHLS 
IYLSIIYLSSTYE*VIILEQKWNSIKKG 
CL* CT IQTNCL * QLKE * VIRL I P KL* YK 
I KLPTCWS VHLNWFTLCNI S WYLKYLAM 
* *RKLVCFA* FVVYNSNKLLITIKRIGH 


8052 


21953 


A 


8112 


392 


221 


PFPKRES PPDKVNPPPPDI FGGGKTKKK 
TLANN* FPPKEKEFFVFFFKKEDFF*NI 
VGNFKTPS KI * AVCF *NP VFKKKKGAPK 
KKSSFKNF 


8053 


21954 


A 


8113 


369 


112 


IFFSTIYYFXHPFFFPPPLFFTPPPLFF 

FFFFFFFFFFFFFFFFFFFFFLAF*HVY 
SF 


8054 


21955 


A 


8114 


418 


248 


KYYLPTPGY*NLILLNGPPFFFFFFF*D 
RVSPCCPRLP*TPRLTQSSCLSLPRSWD 
HRCMPLRL VTF * FTFKLSHQS LRTLLTY 
TLAGTTLG 


8055 


21956 


A 


8115 


229 


78 


S FKGAAPFFFFF * * RQNLTMS PKLVSNS 
RPQAVLP P WPP KVLGLHRGGW I T 


8056 


21957 


A 


8116 


356 


66 


PLFPFFYIKNFKKKFFFFFQYFFFFPPF 
PKHYSHKDIFFFFI IKFFPSSSFSFPPF 
FALSSFFFFFFFFFFFFFFFFHA*FPVF 
FLYKI PMLYPQL 


8057 


21958 


A 


8117 


212 


81 


ERLSHCSLNLPDSSNPPASAS*VAGTSG 
THQLVMWHAGS PRF 


8058 


21959 


A 


8118 


367 


2 


GVFFFFFSPAKKGVFFHPFFFLGPRFFP 
PPPFFFPPLWFYFWGP * KKY* FPHPRGY 
EFFFF*GGPPHFFFFFFFFFFFFFFFFF 
FFFYAEGIGVSPCYIMLGYNFSSFPCGT 
ISPRVRPRV 


8059 


21960 


A 


8119 


38 


215 


FFKSAFHHTRLIFNFFFFLEKTGSHYVA 
QAGVELLG* SSPTTSAFEIGGITGLFLS 
FKN 


8060 


21961 


A 


8120 


266 


2 


PQIFLFIFETRSHSITQVGVQWCDHSSL 
QPQNPGLKQSSLSS *DYRHTLTAPG*FC 
FL*RCGLAMFPMLVLSSWPQAVLKIHPC 
CGMC 


8061 


21962 


A 


8121 


3 


332 


DAWADAWGHVP P C P ANF VFL V * TGFLH I 
GQAGLELPTLSDPPASASQSVFIYFLET 
ESHSPAWETERDSVSQKKKKRGGRFKGS 
NFTSAGLQRFIFFMGPPKLISRAGV 


8062 


21963 


A 


8122 


378 


66 


LFIPLPIREEVFPCIPG*NWAPPGVLYN 
GRPFFFFLLETVFLHAGQAGLELLGSSD 
L P VS AS Q S VG I TGM S HRVQP KLR I F VN I 
SFPFEIFCEIMLLVPLAQP 


8063 


21964 


A 


8123 


2 


316 


I SMLLALLLRI ITFRLPQLNG YVEE FTP 
YECGFNPISPGRAPLFIEHFLVDITFLL 
FDVQIALLLPLP*ALQTGNLPLIVMPSL 
LLIII LALRLDYERLQNGSD 


8064 


21965 


A 


8124 


331 


14 


XTSILLIIITF*LPQLNGYI*KSTPYEC 
GFDPIS PARVPFS I KFFLVAITFLLFDL 
EIALLLPLP*ALQTTNLPLIGMSSLLLI 
I IIiALSIAYE*LQKGLD*AE 


8065 


21966 


A 


8125 


2 


228 


IIITF*LPQLNGYIEKSTPYECGFDPIS 
PARVPFSIKFFLVAITFLLFDLEIALLL 
PLP*ALQTTNLPLIVMSSL 


8066 


21967 


A 


8126 


451 


200 


KA*WNRFAIL3WTKYTCESQEEVKIPTF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F~ Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Ijysine, L— Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine 7 S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AGVWRKL I PAHTDDFVHFKTS VQEVATD 
VAKIAS LLE LEVE SEDVTE * LQSQDKT 


8067 


21968 


A 


8127 


2 


396 


LIVIINTLLALLLIIITF*LPQLSGYIE 
KSTPYECGFDPISPARVPFSIKFFLVAI 
T FLLS DLE I ALLLPLP * ALQTTNL PL I G 
MSSLLLI I ILALSLAYE*LQKGLD*TES 
GWEKLVKKE KADKLKGGRP 


8068 


21969 


A 


8128 


1 


350 


GTRGLQTLLPL I AF * * LLAS LANLALP P 
TINLLGELSVLVTTFS*SNITLLLTGLN 
ILVTALYSLYIFTTTQ*GSLTHHINNIK 
P S FTRENTLMF I HLS P I LLL S LNPD 1 1 T 
GFSS 


8069 


21970 


A 


8129 


375 


1 


LTCFS EHRP P PRAKTFFFSRKTRAP FFL P 
GGFQ I PGPNFFAP P I FPNGGP I PGLDPQ 
PGPLNFF*KSFFPDPKFFFLFFPPPI*N 
SGVFLPRNPVFPKTPNFLFFFSPFFPGD 
GVLLCHPGRLPRA 


8070 


21971 


A 


8130 


32 


211 


GIRHGFAMLPRLVLNSWPQAILLPWPPK 
LLG*QAGVEIiLASSNPPALASQTAGMTG 
VS TRP P VLLPTLGTHLCLDACLGS SL 


8071 


21972 


A 


8131 


396 


5 


KYRDGPPPPGKFFFFFF*DGFSLCCPGW 
SQTSCPK*SSCLPEG*DYRHEPPHMTLS 
S S Y * CS QMQKCIJSTKNCLGS CWAHACNL 
S P LGGRR I TWGKKFE T S LDN I VKNRLKK 
KKKKI YQGGGDHP CIFTN 


8072 


21973 


A 


8132 


347 


34 


KAGPVI PRGPQEFFFFFFPQDGILL.CFP 
GWRAVAKS * VTATSNSWAQAITGMLPP C 
PANF*KI S * RQS I SMFPRLVPNS WPQVI 
LPPQPPK*LGLHVCLVPSLA 


8073 


21974 


A 


8133 


102 


270 


DYRRTPPHLANFFYFL*RQILTVLPMLV 
SHSWVQAILLLQDFFSVQLKMGFLSVP 


8074 


21975 


A 


8134 


338 


47 


FLS TQKES S PLHTHTHTHTHTHSHLLTH 
HHGRGQ E RGKRGR * GG * WRTGTHS PHG P 
HCVRLRKSRPSGELGAAQGGCSLIQPKI 
EDPGQSGGPREVP 


8075 


21976 


A 


8135 


68 


467 


SLLNRIPFCLCLNSFDCVFLLGLLFEAG 
FCS VAQAGVWWYEHGSLQP * PRGLK* SF 
HLSLPKFWDYRNEPPHPPFSCFKALNFK 
DYCDSGTVPCPFMKELSMLFIHALTESF 
VLFIVMSKLITTKHLTIISNV 


8076 


21977 


A 


8136 


500 


355 


FHH V G QAG I Q LLT S GD P P A* AS QS AKI T 
GVSHRARPQTSFLNKLPSLR 


8077 


21978 


A 


8137 


58 


222 


I YG YEGRGS VAGSLS SL * SATTDSDLDY 
DYLQNWGPRFKKLADLYGSKDTFDDDS 


8078 


21979 


A 


8138 


345 


196 


KGDQAQWLRPGMVAHI CNLSTLEGQGGR 
IT*AQKFETSLSNTLRPPISFK 


8079 


21980 


A 


8139 


500 


355 


FHHVGQAGIQLLTSGDPPA*ASQSAKIT 
GVSHRARPQTSFLNKLPSLR 


8080 


21981 


. A 


8140 


3 


231 


HASADAWGRNEWPCAVAHAHNPTTSGG* 
GGWIA*AQEFEFETSLCKIVKPHLSKKK 
KKKKJN b & PG WGIb P P C I PLF 


8081 


21982 


A 


8141 


416 


26 


ARIFPNNSFWPGQGGFPFPPRPLEGPGG 
VIP* CQGVQARPAPPGE PRLFPKPQKI V 
GGGGEPPLFPPPERAGPQKCRPFGEGGF 
N* P KPRP CP PGWGAQGGF VS KKKKKRNE 
DS KI PL IGKYPVGIRNMA 


8082 


21983 


A 


8142 


413 


35 


RGAGGGGSPPGPRYPSQQHLLRARTPTP 
FARSRGSFPGVRGSGWRGGLGG* SRERP 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine ? D=Aspartic Acid, 
E=Glutamic Acid, F=OPhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=JLysine, L=Leucine, M— Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
r=Threonine, v=Vahne, 
W-Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














LPSLPPRALPRSGASSGPLDGGAQGPGR 
RERWAFLGGIRADARGCRAVTSPEPYRL 
PGAGAARARR 


8083 


21984 


A 


8143 


166 


368 


ENILWAAMWLDCHPSTLGG*GGRIT*G 
QEFKTNLANIERPHLFFFFFFKRDSLFA 
PRAEGGGPILF 


8084 

• 


21985 


A 


8144 


1700 


621 


DLRDHS VLQQGPACLTRCGEP I LSHSEG 
SKIGSERESGEHSRHPEAPPPSVEVGSR 
RS * GQDS *GFGPGNGQDTRGPLSHAAPN 
WTPDPAAAVAAGGTSLP PRGGGGI * RGR 
SHTARSGARVGSANAHARGGSGFSRHSG 
SHGCLRNQSPRLGKAGALCGSAWVSGL* 
GSGLTS PRGRRERLRMPG VAKGRPR *GN 
PRRGRG KQHG I PAS GL PGRCWLG I KMS T 
LQPRTRAPEELFVPQES SGSTPEKMLVS 
FHG S S LRNE AT PR YS QQ E E AGNGRWQ Q S 
LSLERWPPWTSHPLGTPPLMPVAVARCC 
ILPGLWPLLSPPSGSASPVSQGPGCLSL 
WPNAFKKDDFPVRQGDTSWRSVS 


8085 


21986 


A 


8145 


390 


38 


NTTTFNIYFFYFFFFSIiFIYFFFFFFFF 
FFFFFFFFFIFILFIYSLLVFGVWVLGL, 
GLALALLAKLFLVNSLCRRYRG* SLLYY 
AWL*FFIFPLRYYIYCARFQFLSPILYL 
GKWFG 


8086 


21987 


A 


8146 


20 


187 


KLTTS TGRNPGRPTRPNF * VF * RDGDL I 
MLPRIiVSNSWPLVNIjLPWPPKILiGIjQM 


8087 


21988 


A 


8147 


417 


64 


VRPRVRPRVRPRIRKKKKNKKKKKKKKK 
KKKKKWGGGF * D FWGGE KGP P PLQKKDF 
LLFWGFFLKSFFFFCGGLFFWGGPFFFP 
PPPNFLGVGGKKKTFRLLCGP FSAFFGG 
EKN 


8088 


21989 


A 


8148 


351 


187 


RLGTFYLSNILRFYLSITHGPIPYIGRL 
TSQNLPLLLLDLT*LEKLLPKKSQTYN 


8089 


21990 


A 


8149 


465 


60 


P PH F WGNP KKKNS Q F FWAKKKKKI FLG P 
* KMVAKKGWFWGRKKGFQRGGKKG* KG 
GQLFYPVLSQVLGGGQNHGPPGKAGFGE 
NFFFFGQGLGWAI SRWLRTPLFLAGGGE 
D P KKN F KP P P K I KT LGAGVKKK 


8090 


21991 


A 


8150 


1 


175 


WD YRHVP P YLANVF F VI FRGDKVS L C C P 
V*GPTTEL,KRSSCLGFP*GWDYRREPPH 
LA 


8091 


21992 


A 


8151 


59 


35 


RWNSRPRRPGLKFLTSGDLPASASGSAV 
ITDVSHHAWPETFFR*RA 


8092 


21993 


A 


8152 


269 


159 


ARVEVSKKYIYIHTHTHTHTHTHIHTHT 
HTYTTKKLPS*LFQVCKMEKFGRLWQ 


8093 


21994 


A 


8153 


1 


256 


PGPPPRFFFFFFFFFFFFFFFFF* TLKI 
RFIWRKQKTHNPKDGILHQDISQVISLQ 
I TNHFHRR YF CTN FTC I QERD I NQ S L FL 
F 


8094 


21995 


A 


8154 


252 


41 


LIIPVLICIVFIVLQFKIFSVISLNKGL 
ILPPWPPKVLGLQV 


8095 


21996 


A 


8155 


21 


443 


HELSLI*FISTLAETNRTPFDLVDGESE 
L VSG FS I E YAAG P FAL FF I GE YTNT III 
NTLTTT I FLGTT YDALSPELYTTYFVTK 
TLLLTCIiVL* IRTAYPRFRYDQLIHLL* 
KNFIiPIiTLALLI*YVSIPITISSIPPQT 


8096 


21997 


A 


8156 


2 


205 


LLHLMKHFNLE I QKAQ * TR INLKRS I SR 
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Predict- 
ed end 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
0=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G-Glycine, BNHistidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine 7 R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X—Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HVI INLSKAKHKG* ILKAVREKMLIMYK 
KPLVKLQVISN" 


8097 


21998 


A 


8157 


2 


205 


LLHLMKHFNLEIQKAQ*TRINLKRSISR 
HVI INLSKAKHKG* ILKAVREKMLIMYK 
KPLVKLQVISN 


8098 


21999 


A 


8158 


2 


222 


VYTWGTGKARVFPGEF* LPVW I PTRHLK 
FYNEPIRDANESASAETENPQSSIIHPQ 
DE QNGD I RRTD KVT I HQ 


8099 


22000 


A 


8159 


1 


297 


GFHHVGQSGLELLTSGDPLASTS*SAGI 
TGMSHHAWPLNCSYGDSFSWLKCPELTV 
ACWWEEGWRKLLMENEQGREAHQKLESV 
INFYIKDSMTKRKYK 


8100 


22001 


A 


8160 


138 


437 


NEYDHFSIIKERRDFIVCVFFFFFSKKS 
LTVGTQGGPSYLEPPFLGNKEFSCLTFL 
GSWDNGGATTRLG* FFVFFRKKGVSPCG 
PGWS*TPDLRKTPPSA 


8101 


22002 


A 


8161 


330 


148 


QQSKTPFFFFFFKKKKKYIYICMWHAY 
NP S YS GG * GGR IT* AQKF KT S LNNT VR P 
HLYKK 


8102 


22003 


A 


8162 


391 


255 


RPAPADF* FFFF YRLGLAVLPRLVWNSW 
PQATLLPQPLRVLGLQI 


8103 


22004 


A 


8163 


131 


3 


HKVSLCYPG*SALMQS*LTVASNSWAQA 
ILQP*PPE*LGLQA 


8104 


22005 


A 


8164 


97 


440 


NKGSLCPPGGRPGALSHPTARSRGSPGP 
TPQKTGNNGPGPPTPVNLEFFEKRGFTP 
VGQGGLKLRT*RSPPPSAPQKAGITGGS 
RRPGP IQKKFLKTLLGETKWMAKLIKTY 
LK 


8105 


22006 


A 


8165 


541 


295 


FSQQKKKIFISTNLSHQESKIYICKEME 
SHSVIRLLEGSGAIMNHCSLKLPSSSDP 
PASVS * SAGI IGVSQHLANYTFKKI F 


8106 


22007 


A 


8166 


440 


259 


PPQKFFFFFFFVKTGFRHW*AGFRLLD 
S SD P PPLAFQSVE I TGMS P S ARPMFGVF 
HSLR 


8107 


22008 


A 


8167 


23 


405 


FRI CHLLRVHLL VY F I P S FTDNHLLST S 
AMFLYSNDWIAQNVSGPFLGI I FRTFF 
FFFLKGE FFFAPQVGGQGGNLG* LNPP P 
PGLKGFSGLTP PGGGNS GGGPPNKPNFW 
I FKKGGGSTLWPR I I 


8108 


22009 


A, 


8168 


23 


388 


PYFS C AL P RHH PPT TNLGPHLGMGAPWG 
QPSVAHHTLLFFFFFLKRDFCFVPRVEK 
LWLEE S S LNPPLLGLKE FS CLTLWRTGN 
NGGPPPPPVIFCF*KKGGVYPWGPGGGE 
TPALKETPGL 


8109 


22010 


A 


8169 


612 


498 


NIHFS LS FSHTHRHMHTHTHTHAHPTY I 
RTP*DNVPQ 


8110 


22011 


A 


8170 


2 


288 


VNLTFFPQHFLGLSGMPRRYSDYPDAYT 
T*NILSSVGSFISLTAVILIIFMI*EAF 
ASKRKVLIVEEPSINLE*LYIKKKKKKK 
KKKKKKKKKKN 


8111 


22012 


A 


8171 


381 


101 


QKSLQQPSNIATILEEVQVIFLGSPYHL 
QRWEMRLTKWLRGLGAVAHACNPS TLGG 
MGGWIA*AQELETSGDPPPSSNRQLFTG 
EHNNDNKLI 


8112 


22013 


A 


8172 


136 


360 


TKKHFGLLK* * TKKHFGLLKKKKKKKKK 
KKKKKKKKKKKGGGAFKKKKFFPRGGGK 
I FFFFGAPKNKFGGGVLKTGGGKKPGVT 
KK 
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Predict- 
ed end 
nucle- 
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location 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~AIanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KHLysine, D=Leucine, M==Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S-Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X— Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8113 


22014 


A 


8173 


358 


58 


EKVSVFFLKMKCSGIILLPCRLTLPGST 
DSPAS VCRMPAIAGPRRHA* LVLVETGF 
RCVGRAGLQPLTASDPPTSASPGAAIAD 
GVSFTQCSMVPRIRVQ 


8114 


22015 


A 


8174 


254 


370 


GLTVLPGLVSNSWAQVILLSWPPE*LRP 
QAYATGAGFLL 


8115 


22016 


A 


8175 


48 


269 


FYVILCLRDRVLLCCLGWS *TLGLKLSS 
YLSLLSSWDLQHVPPHSASCLTF*VKAC 
FFLLKYLKRLKIYAKIM 


8116 


22017 


A 


8176 


1 


338 


PTRP INTLLALLL IIITF* LPQLNGY I E 
KSTPYECGFDPISPARVPFSIKFFLVAI 
TFLLFDLEIALLLPLP*ALQTTNLPLIA 
MSSLLLIIIIxALGLAYE*LQKGID*AEF 


8117 


22018 


A 


8177 


426 


130 


QSLDPPGNMVTLSLQN*KNPPP*GGAVF 
PPPGRLGGENPPGGSKEQKRRPAPQGG* 
P RQ P P S P KKKNAD S E FS I FNLQ PALL C K 
Q YC I VLHAL PNK I L F 


8118 


22019 


A 


8178 


396 


244 


KIHLL Y I CMC I Y I YV I S RH Y I LR * HTHA 
HTYTHTHTHTHTHTPKNTKYTLW 


8119 


22020 


A 


8179 


2 


119 


TRPGNWPGWAHFYDPSTLGA*GELIT* 
GQEFETSSSLY 


8120 


22021 


A 


8180 


2 


80 


FLPLTLALLI*HVSIPITISSIPPQT 


8121 


22022 


A 


8181 


438 


333 


S PLGRLRH YNCLNWGGKGCN * P KS CP CP 
PAWGTK 


8122 


22023 


A 


8182 


393 


19 


IFFFFFPPPSPAKFFFIFKKKRFPPLWG 
GVF*SPPPGFQKTPPPQKVGFSRVFPPP 
PPKKFFFGGGKYFFFKEPPPPPPPIL*K 
GGAKISPPPPPGKKKNPFFLKGGKKKKT 
PFIFFFPPKKIKN 


8123 


22024 


A 


8183 


24 


269 


GSQQVLGFCFRDRVLLCHPGWSAVM*S * 
LT VAS T S WA* R I LH C S LLGS WE YRHT P P 
CL*NRNPQLTVKWRVKDHNNKKSHR 


8124 


22025 


A 


8184 


154 


361 


WHNHDPLQPQTHGLKRSSHLGLPS I WHY 
RHYPPRLANFLNLS FRQGLAMLPTPSVC 
HLLVFLS *HIYWTFGYLS*SVCSHLLNF 
L* * WHNHDPLQPQTHGLKRSSHLGLPS J 
WHYRHVPPRLANFLNLSFRQGLAMLPTP 
SVCHLLVFLSETLGFK 


8125 


22026 


A 


8185 


228 


337 


RPGMVAHACI PG I LGGRGGWI T *GEKFE 
TSLANMVK 


8126 


22027 


A 


8186 


314 


127 


DIQPKNEGEGEEEQELWLGMTAHAYTPS 
TLGD*GGRIA*VQEFETCLGSLARPPSL 
QKNFKN 


8127 


22028 


A 


8187 


10 


420 


TQEAEVAVNWDHATALQPGRQSETPSPE 
KKKKKRG P PR WGENF F P TPGG F * KDGGN 
KKTGFP P FKRDGP P P P KNWT P WVFQKRG 
APPEPPPLGGALGREPLGGFFGKGGVET 
PPPP I FKKNNPPEKKPHRGFNFFAF 


8128 


22029 


A 


8188 


398 


1 


LKKKRPQTPFSPKIQF*KFFPPFPFKKS 
PFPKNLLKKKRAPGGPIFFFFFYPGLFK 
NPSPL*MGSPPPPLFNPKN*KKPPLLKK 
GPGF*WPFFQKKKPPFFFFFFWRQGFTV 
LPRHTP S CLVNFC I FCRDGX 


8129 


22030 


A 


8189 


30 


153 


AW YVRRE DHVS LGG * G CGKP WLCHGT PA 
WVTEENP KGKI K 


8130 


22031 


A 


8190 


206 


45 


HPSQNSLSKN*ERVSVCCPGWSAVAGSQ 
LTAAS T S QAQ VI LLPQPPE* LGLQA 


8131 


22032 


A 


8191 


3 


228 


TCVCLCVCVY I CDVCVGCVCVYIDE Y I Q 
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peptide 
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Amino acid sequence (A=AJanine 
C=Cysteine 9 D=Aspartic Acid, 
E^Glutamic Acid, F— Phenylalanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=JLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \-possible 
nucleotide insertion 














I YR Y I HT Y I D I YTH I * I Y I HT Y I DTHM W 
VCVYI YI YI YRCMGVRET 


8132 


22033 


A 


8192 


41 


416 


I CRVKE I ASRPLLE THSLS LVELS *FCL 
TAQ * RND FLiAL KHLTAVL YNL I LHS S G F 
WSSLKSHF I FLAL * Y YL* * F I * SN*RRD 
LAVLPRLVSNSWPQVILPPQPPKVLEIR 
YEPPSLALILFFN 


8133 


22034 


A 


8193 


2 


183 


KRLSIWQKKVEGEGFQGMDVGPVSVAHA 
CVS S S LEGQGGKI A* GQEFET I LSNI VR 
PHLY 


8134 


22035 


A 


8194 


2 

• 


465 


AGVQWHDLGSLQSPPPGFK* FSSLSLPS 
S WDYRS AP P CP AD F C I FRRDWAGWGVGC 
G FHR VARAGLE LLTS GD P PAS AS P GAG I 
AGMSHRTQRGQDNFASWRRGGPQMSPSP 
PSCLFLQNTFSLKPSWVAPYPHASALGG 
GEYGGPPSPQPHLE 


8135 


22036 


A 


8195 


397 


149 


L I CLTDLP IRFLSLLALVFTVFNTKVKV 
KDLIKLGWAHTCNLNTLGG*GGRIA*A 
HEFETSLGNIVRAVIPVCCFSVILSR 


8136 


22037 


A 


8196 


355 


95 


KKIMPTLCLKIPS*YRCEEIIQL,SLQSS 
CDHRHMPPHPTNPF*RQHLSMLRSLVLN 
SRPQAVLPHGPPKVLGLQAPATAPGXIiS 
LF 


8137 


22038 


A 


8197 


382 


112 


KTDFGKKGSPSSPPPFFFFKKSFAPSPR 
GEKKAPIiKSQRGLCPPKPKDPPPPGSG* 
TQDKGPPPPGQP I FFFFLKKGVCLFSRG 
GSKTFI 


8138 


22039 


A 


8198 


32 


361 


AS R I GF I LQE FL P VR VWTTNP S AGADG Y 
NFLEKQKGTEKTS C S VAQAGAQ W CHRNL 
L* P*TSELKQSSCLSLLSSWDYRMYFVL 
LGKQVTLLIGKRALSVYPRDWSLC 


8139 


22040 


A 


8199 


1 


314 


INTLLAIiLLtl I ITF*LPQLSGYIEKSTP 
YECGFDPIS PARVP FS I KFFLAAI TFLL 
FDLKNALLLPLP*ALQTTNLPLIGMPSL 
LL 1 1 I LALSLAYE * LQKGLY 


8140 


22041 


A 


8200 


3 


109 


FTS KHHFGFEAAA* YWHF VD W* LFLYV 
SIY* *GS 


8141 


22042 


A 


8201 


3 


216 


DAWAAVGHHHTRL IF* I FVE IGSLCVSQ 
AGLEILDSSDPPSSASQSAGIRGMSHGA 
GQEHLFHRPIGKIK 


8142 


22043 


A 


8202 


1 


267 


RIHTGEKPYKCEECGTGFKGPSTLTTHK 
FFVYCRE VAVLLKNCYSHLYPH* I XVNG 
SEADKREMCLLCIFFHPLAEEQENVENK 
RKYNP 


8143 


22044 


A 


8203 


422 


92 


PLVFALPPKIGFAPTTPSSSPPPPPLFF 
FGPPVFPPPPFFNPPPFFFFSPPPKKKF 
PSPPPRGKIFPF*TPPPPFFFFFFFFFF 
FFFFFFFLFLYYLFILC 


8144 


22045 


A 


8204 


445 


260 


VCS PPLF I WVAPGFS PPPLFKNPP PNFF 
FGAPKKKNFFPPPPA*NFFFLKGPPLFF 
FFFFF 


8145 


22046 


A 


8205 


103 


248 


GPGWAHACNPSSLGG*GQWIA*AHEFK 
TSLSNAVRP I SAKKKKGGPF 


8146 


22047 


A 


8206 


407 


197 


Qhl FKFL * EQASHYVAQAGLELLGSGMP 
PASTSQSVKVTGMSHGAWLLAPTLRSTS 
LNYNSLSVLLLPRP 


8147 


22048 


A 


8207 


1 


142 


S CS VAQARV* FRDLGSLQPPPPRFTPFS 
CTPAWETVRLHLRKKKKKN 



1049 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalan ine, 
G-GIycine, H-Histidine, I=Isoleucine, 
K— Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 


8148 


22049 


A 


8208 


2 


370 


PEVFDYPN* FKNL*LHKTCTQ*L*AALF 
VI TKTWNQGCPLTGEW INKLE YIHTMDY 
LAI KINELANHKMM*RTLKC I LLSERS * 
SRKATYYMI S I IRHPRKGKTI EIVQNS S 
CQGLGERQKGL 


8149 


22050 


A 


8209 


363 


108 


KKPRVEACLS PGVGRERAKRER*KRERE 
RERERERERENEL*KLSLADLCIKNG*R 
EFFKEKRYVKRYAKILRVSGRKQKYG3W 


8150 


22051 


A 


8210 


386 


254 


LNPGGGVCSEPRLCPCTPAWVIR*DSVS 
EKGKKKRKGPENSLTATOF*ALSCNLDY 
LRCSMLKMQRQRKRKERTEVRFWQLHHN 
LTLGFV* L I VYQRE *RRTGWF VAKYML Y 
TFS * PGSFPRLRKCVEHVFGYEPACS PL 
FPLIHNQLHKAKC 


8151 


22052 


A 


8211 


3 


131 


AHLF I AWFTE YFKP TVI T YCS KKKI P FK 
MLLLIDSAPSYPRALMEMYKGMNWFMH 
DNLTFI LWPMDQ * VTFTS * S YYLRN* FC 
TIAAIDSDS SDGSGQSNLKI F* KGFTTL 
DAIKNICDS*EL 


8152 


22053 


A 


8212 


86 


286 


WSIWECTVISVQLFHKPKTSFKNTSYWS 
GVMAYTNNLSTLGSQDPQIT*GQEFETS 
LANMVKSCLY 


8153 


22054 


A 


8213 


393 


344 


SFPFSLPSKWGFKNLVPSPGYFLFFLNK 
GFRYFGQGGFKILNSNYPPPLASQKGGI 
SGISPWPRALVFF*WRNLGPN*RTIVSP 
NYGLK 


8154 


22055 


A 


8214 


3 


313 


QGLVLWHSHGLLQP*PPGLRRSSCFSLP 
CSWEHRCTPPHLANFLILCVCVCVCVET 
GSHFVAQAGLELLGSSDPPLS ISQKSWD 
YKV* AI I PRAKQNYLMLFY 


8155 


22056 


A 


8215 


165 


348 


RPCPLFPEDLLTFSREGPGTMAHACNLS 
TLGGQGGGIA* GQE FETTLGNI VRPHLY 
FYLKK 


8156 


22057 


A 


8216 


379 


218 


VFLVETGFLHVGQAGLKLPTSGDPPASA 
S VSAG I TGVS QKCR F * YNVF I FQKS S L L 
NALP I CS LTMLLGHLFSMFMTPKFFTCS 


8157 


22058 


A 


8217 


32 


276 


RYLPIIAALFTIAKI*KQPKSSLLDKWI 
KKLWC I YTME YYS ALKV I LT WTTWMNL 
VGFIPGDINHIQKAKYCMISLICGI 


8158 


22059 


A 


8218 


171 


2 


KGVFGLGRLFFFLDRVWLCLPEWSTWQ 
SRLSATSAS*IQAILPPQPPEELGPLCP 


8159 


22060 


A 


8219 


530 


73 


TKPTVGSNSRSYINQTQYFPSTLTFGGL 
MRLTI YHFFFLLVYYNPRVMVTPAYM* L 
LIL FYS S S IRDLL IAKQ P CL PNEGFS PH 
AC*TNATLLFPYS I * YLGLPSFHLIFFV 
CFSFYFL*R*GLTMPPMLVLSSWPQVIL 
P PQ P P KV LGLQA 


8160 


22061 


A 


8220 


350 


135 


F F F F S QTGVAQNRVQ * RDHNS LQ P * P PG 
LRQSSCLTLPKCWDYRCKPLCLASWIYF 
KDKVYTSTPVHTQEY 


olDl 




2± 




1 

Ioj 


n i 


SLGNINSVPKKRRKRRKEMM 


8162 


22063 


A 


8222 


73 


232 


YGSFKFITLTCLSWLGTVAHTCNPSTVE 
GGGGRIT*GWEFTTSLGNIVRSCLY 


8163 


22064 


A 


8223 


418 


211 


AMPSLLTYLLTF I FCRKKKI FFYWGEFT 
MLS *LVLNSWTQVILPPWPAKVLVLPRL 
ALFYSLLVLFPSF 


8164 


22065 


A 


8224 


227 


33 


ESHSTTQAGVQWHDHGSLQPQPPGMSLY 
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Predict- 
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ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIa!anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CH* CLRFWSWRPAEKPGWCSSLRFMELE 
SWSGASDAF 


8165 


22066 


A 


8225 


244 


287 


TDSPTSAFRSAGITSISKHTLPKSTF* I 
FCRAGGIiAMLPRLVIjNSWPQVI LPSQPP 
KALGSQA 


8166 


22067 


A 


8226 


53 


235 


NNKWQGWAHTCNPTTLGG*GSWIT*GQ 
EFETS PANMAK I P PL.VMFAF CFEETDNK 
SHTK 


8167 


22068 


A 


8227 


275 


25 


GVLMFNFKFCI * QNS P FMLP G WLNS W X 
PAI LP PQRP KVLGLQAWT I LYKWY YGKE 
MWHKE I CEGTHDVLSKLLTTRRFKRHL 


8168 


22069 


A 


8228 


386 


211 


QLKLPPBKGNKLRPP**KFFFFFFFQRQ 
GLTILPRLVSSSWHQMIFPPQPPEVLGL 
QA 


S169 


22070 


A 


8229 


1 


117 


VFIL F YF* DGL AML PRLATNS WAQA ILL 
PSAPKVQGLQA 


8170 


22071 


A 


8230 


356 


26 


WHVHGSQQP*PPGLKQSFCFSLPSSWDE 
RCAP PHP INYFQ I CRDRVYKRWANMVI S 
PNPLVLISQSAGITSKIPCPGSEISTIF 
IFSHQLLLVRGVYPDMLVLQLGKTCI 


8171 


22072 


A 


8231 


3 


188 


FP S LRPA* IFFYIYFF FLKNKVFLGHPG 
WTQLLD PNWTQAAGLKQS SHLS LLS S LN 
FRGSA 


8172 


22073 


A 


8232 


2 


253 


KLDYINCGTNLAQKCLFCLFGSLFFQR* 
GLTLLLRLFMNSWPQAILLP*PLTVLGL 
QATMLGSQVPVFDLGITLCRCRFGKLE 


8173 


22074 


A 


8233 


373 


65 


PEKAGSGLELAFGLQMQHPPWASTLPAA 
LTD FGLAT VPNHVS Q FL KS HTHTHTHTH 
THTHT CTRTLL VLF L * E ALTT T P LLRS I 
CKHRESACSLSLTWPLCC 


8174 


22075 


A 


8234 


390 


46 


RVFKPG*NPPLVKAPAPPLFFGGV*GRS 
PKKKNPQGPGGGGPPPGFHPFFLTPFFP 
IFLRREPPPFFFRGGLGPPPPPFKTPFF 
FFFWVKGGKFLPQKKKKKFFQS ISTRS V 
IHK 


8175 


22076 


A 


8235 


387 


216 


QKLGVGLVPPGRVIFFFFFFFCC*GGGL 
TMLPRLVSNSWPLAILLPQPPKVLGLQV 


8176 


22077 


A 


8236 


1 


201 


RFFFTIPPQCFPQFILSF*RRGLAVLPR 
LVLNPWPQAILPPQPPELLGLQAHPTTL 
GPVFYSYCVRA 


8177 


22078 


A 


8237 


399 


229 


S VGGGGLTMLPSWSQTPGLK* SSHLSLI 
KCWDYRPEPPHLAHFSLLNLQLYCHTKL 
S 


8178 


22079 


A 


8238 


192 


335 


SHTLPHSSNFS YFW * RQGFAMLPRLVLN 
S WARAVG L PQ S P KVLGLHA 


8179 


22080 


A 


8239 


442 


254 


GRDRVLLFCPGWSQTLGLKRS S CLGLPK 
HWDYRHE P PHPAEN I KFC VWPTL YFYWP 
ALVQ INL P * QLQGTE CVACMC 


8180 


22081 


A 


8240 


381 


180 


VLKLFFCSHRGPTMLPRLVSNSGLK* S S 
HLGLPNCWD*RCEPWHPASLCLLTSTFW 

T)DTT T /^TVTTtJV 


8181 


22082 


A 


8241 


388 


260 


NAI P F Y I TKCCNP S TLGG * GG * IT*GQE 
FENCLANMVTCPCLY 


8182 


22083 


A 


8242 


1 


145 


GFTILVRLVLNS * PLDPPALASQGAGIT 
GVSHCAQLIYVFQRSLQLLW 


8183 


22084 


A 


8243 


1 


173 


GLT VS QAGVQW CDHS S L * S Q TLGL KQ S S 
SLSLPSSWDYKCKPQHQADRFFLMEFFQ 
M 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
stcid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, KNHistidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8184 


22085 


A 


8244 


202 


327 


TLFITFIFF*R*GLAVLPRLVLNSWPQV 
I LLLW P PRVLGLHV 


8185 


22086 


A 


8245 


42 


289 


KE KRS 1 1 LF I FFCRDGVLLWLPRLDLNS 
WPKTIFPPWPPK*LALQAHTTIPRGGGS 
IILICIRDVKRERKQYAKSKNPEGET 


8186 


22087 


A 


8246 


592 


485 


NIVSQAGVQWHNLGSLQPLPLGLKWFSC 
LSLLSS*DYRHVPPCPANFCISVETRFC 
HVGQAGLELLASNNPPAPASQSTGITGV 
SPCTRPRVYVFLLWIIjANGPS*RAETLV 
NLATNQEDADSSIiAPGLGSGHCF 


8187 


22088 


A 


8247 


307 


12 


MCSPVFSPKQKNPLFLNFFKKEFLFFFP 
GGRQGGHFGFLAP PL PGLNKFL VLP FPG 
NGDTGKG P P ARGNF F FF F F FG I F SRD F V 
SPC*PGWSLIPDPR 


8188 


22089 


A 


8248 


392 


237 


H YGQAS LAPL T S GD L P AS AP Q S AE VTG V 
S HRAWPTVL I * S FS * KFNWV 


8189 


22090 


A 


8249 


153 


386 


FFPPGWPKPSLLEPPPPWLKGFPPPTLL 
GGGD * KR P PP P PGNFCF FKRNG VS PL WE 
GWF*TSNLGNPLPSPPKRVGIT 


8190 


22091 


A 


8250 


387 


166 


IETILDAIKNIHDSWEEVKISTLTRVWK 
N* I PTPMDNFEGFKTSLEEVTABVMGIA 
RELEVEPEHVIELLHDIS 


8191 


22092 


A 


8251 


405 


266 


LFAI VKR*NHPKCLLI YARINKMWHI S I 
GWNIISLISKGMKFSYMFLTWMNLEVIM 
LSEISHSQKVT*YMVPLA*GHLEKLNCL 
PIKQQTPIPSSSSP 


8192 


22093 


A 


8252 


379 


201 


PGANYACNPSTLGGRSGWIA*AQELETS 
LSNMTKPQPCSTPQCFLVNQLLLQNLYP 
YFP 


8193 


22094 


A 


8253 


78 


311 


LTL YFHTRKKKGLK* I F YF I * S I LSQKD 
L KVG TG PGAGAHT YNPRT WGG WKGLQAG 
KRIT*AQEFETSLDNTVKTCL 


8194 


22095 


A 


8254 


303 


81 


TPCAENFKLRKRFHHI ISLQRKMNYLIS 
FLYF**SHSLTILPRLVSNSWPQAILLP 
LPPKVPGLQVSATLPGL 


8195 


22096 


A 


8255 


2 


376 


MGYNLVCHLLRPLEKRSSIRVGVT*FSG 
CPPSPLPLARKGYSLTPCASQVRQCLAL 
LWLTLGGLHPLS CPHCPTS PS VMNPVPQ 
LEMQKS P VFCVSHAGS CRLELFLFGHLG 
TQSSLDSFFKKVI 


8196 


22097 


A 


8256 


70 


356 


LI I AYNLTYDLSWKMFYVYLRKI YTLLH 
WVECS I YTCWV* FVYSMVSWNR1DSPGK 
KKTN I YDQL I FNKGAKSTKLRKNS LFNK 
WGQDKISTYKK 


8197 


22098 


A 


8257 


403 


85 


PFCPWFPPVFIPPPLRKSGTPPGPGPEF 
PGPGFSNGGNPRPLKKSPKFSPGWGRGP 
LFPPSRGV*AGNFLGPGRWRGPWAKFVP 
LPSCLGHKARFFFPKKKRRKIK 


8198 


22099 


A 


8258 


249 


2 


YGEGDLGSYRPGVSKCMEFSPRRDIRIM 
P WTPGAVC YAYNLSTLGGRDGWI A* A* EF 
GT S LAY I VKPHL YTHTHTHTHTHTM Y 


8199 


22100 


A 


8259 


2 


189 


IQCVCVTKSHFCHPGWSAVAQSWLTAAS 
T S * A* VD F S HLS LLKTS WD * RRVT PRL VN 
LILFPS 


8200 


22101 


A 


8260 


453 


128 


FFFFFFL IETVSHDVP * AGLELLS S SHP 
LAS AFQ S AE I TG VS HHAQ P SS VS WL I HS 
S 


8201 


22102 


A 


8261 


420 


79 


KNPTPFFFFFFLIETVSHDVP*AGLELL 
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NO: of 
nucleotide 
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SEQ ID 
NO: of 
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sequence 


M 
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NO: in 
USSN 
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26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A la nine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=OPhenylalanine } 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y—Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possib!e 
nucleotide insertion 














SSSHPIASAFQSAEITGVSHHAQPSSVS 
WLIHSS 


8202 


22103 


A 


8262 


1 


262 


NS RTERLGS WQ S HF CHKEME AHF VVQ S G 
VQWCDLGSLRPLPPGFK*FSCPILLSGW 
DYSQCRPCRTCSLIFMHSRSPSQPSEKK 
WTC 


8203 


22104 


A 


8263 


321 


100 


SATKNYLLSRQS F IGK*NNGLGAVAHAY 
NSSTSRGLGGQIA*A*EFETS *GNMMRP 
HCYRKYKNLARHSTYSPR 


8204 


22105 


A 


8264 


145 


275 


KPRDYSFFF*WKRDIAMLPRLASNSWTH 
AILPPWPPKVLRLQG 


8205 


22106 


A 


8265 


73 


351 


SLCYRSFARCDRPKKGQHFSKSYLQDAQ 
WAR TIiLS TI AQNItf S AEHHLKT TD W * GAV 
AHACNPSTPQGRGGRIA*VQEIETSQGN 
TGRPHLYKK 


8206 


22107 


A 


8266 


382 


232 


LVETGFCHVGQAGLKLLTS GD P PASAS Q 
SAG I TGVSHCAWL *T*YS*DFQ 


8207 


22108 


A 


8267 


397 


130 


X GQAGLKLLTSGDLPTSAS QTAGI TGMS 
HRAWPTFSLIAR*NASFQVGHFSPKYQV 
GTLAKS L KCHQ VR S D TLE FILCCRIIYE 
TLGKF 


8208 


22109 


A 


8268 


309 


97 


DTW YRTRRKDHCT I TETR IQSVRPLS PR 
*AITSRVTCTYTSR* PEANEEPQKITFR 
HCDLFLPTLTDQLTL 


8209 


22110 


A 


8269 


2 


224 


DSSGIPGSPTRPPTRPKKQKLPCDPTVS 
L.LG I C I YDQRQ * SVFQRD I CHSHMF ITA 
LFTIVNIWKQT*HPLMDE 


8210 


22111 


A 


8270 


374 


2 


TSWESARKPPPIHSHTL.KGFFFFLFMFF 
FF*NKICKDGSLTMLPRVVLNSWAEGII ) 
LPRSPKWGIQELATQRRASILLCNPTY 
ACGI FQHWVNVKL.LMKQCPFCPAAPTK 
TQNFFPRV 


8211 


22112 


A 


8271 


1 


379 


PTSASQVAGTTGTHHHIVLFFFFGIFEK 
KGVFPFSPGGF*TPKPKPPPGFGPKRGG 
KKGGGPPPPARGMGSGVFYPPPQKPFFV 
IiIGGQ I IiFKGRTLDFKKRAP P F P P FSIi F 
LPKHTPSFWIKWDP 


8212 


22113 


A 


8272 


80 


401 


AWGMRFCLLLYS CQYLISD I AHFS KAFF 
FWKRE P P PAPQ VKGQG PWL S * TOPWLQG 
*RGFSGLTPKGGGNYGPPPPPPIiIFFFIi 
RKNGFFPGWPGWSKTPDLRKPPC 


8213 


22114 


A 


8273 


405 


95 


PPKRFWGSSKGFPQREKKLGSPTFPKKK 
KDFIYSYLS KSLKKE * INRP SAAAHTCN 
PSTLGGQGGRIT*GQRFKISIiSNTGRPH 
LYEKIKNVKKLKIKKPNKL 


8214 


22115 


A 


8274 


167 


52 


DREQRPGAMAHTCN PS TLGG * GGW ITRF 
IiANMVKLRLY 


8215 


22116 


A 


8275 


276 


23 


KYVS SRPGTVAHS CKPS PLDGQGEWI A* 
VRKFKTILGNMMKPHLLFYKKLTFKNVK 
CI FKKYGATESRYVTKEKQKDRKKGTKE 


8216 


22117 


A 


8276 


1 


114 


FTMLARMVW I S * PRD P PASAS QSAGTTG 
VSHHARPGLS 


8217 


22118 


A 


8277 


2 


87 


TMLLRL FLiNS * AQAVILPWPPAVLGLQG 


8218 


22119 


A 


8278 


357 


347 


SEGLALYQLEAFTVCDLRERGFCRGRSL 
WPLITWAWRGGVFLLVQFQEVCCRLALG 
SLSPDPVLLPESGGPTSASQSAGI IGVR 
DRAQPNCVFEIGSYSVTQAGVQ*HNHGS 
L*P*P 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GJycine, H=Histidine, I=IsoIeucine, 
K— Lysine, L=Leucine, M=Methionine, 
N=Asparagine ? P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W—Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8219 


22120 


A 


8279 


371 


125 


FFPKKNEPPG*PPPKK*RKPGKFILAPS 
PPFFGGGEFYKLKRGVLKQKPGPPGVPK 
ILKTPPRLGKNKPISPPPGQKKQNFF 


8220 


22121 


A 


8280 


377 


63 


PPVFPPPWGGQGGGFP *ARGLRTPWPTR 
ENPVFF*NPKN*PGGGGPF*IPPPWPG* 
ALKFPLPPRGGVPLTKNIPPPPRFSHKR 
RVFFPKNLKKKKNPCLIESL 


8221 


22122 


A 


8281 


2 


284 


LPK* VKNFVHAKI CT * MFVAALFVI TS S 
WKQPRYYSSIEKWINKLWHISAMEYYSS 
LKR*ELSSHEKT*KNFKCGSLSEKSQSE 
KDTWCLIPVI 


8222 


22123 


A 


8282 


318 


40 


AP RL S LWPMWF I KRLD S S P GSR I T WT 
NCFENHLCVCVCDRFLLCQAREQWRVLG 
SPQSPSPKFKRFSCLSLPSSWDSRCKTP 
*PANFCGLC 


8223 


22124 


A 


8283 


30 


343 


CYQKASHATEKSFMR*RVNPCSKLCCCY 
KILPQPPQP S AVI TL I S Q QNKKKKKKKK 
KKKKKKKKKKKKKDGGGALKKKKIFSPR 
GGR IFFF FF WAKKNNLGGGL 


8224 


22125 


A 


8284 


163 


392 


FWL YNFS VI YSIS DPCWYKEQLCKS VF 
SFLSS*SIjLDEPNPNSPANSQAAQLYQE 
NKREYEKRVSAIVEQSWNDS 


8225 


22126 


A 


8285 


406 


63 


KKKLILAKPGWNLDSBKRAPPFFFFFF 
RHGVPLCCPGWSKTPEPK*SSCLSLWSS 
WDYRREPIiCLAYS PVLNI THLSSPNVKL 
CRDSQES YDPI IMMMVI I ITEGGMI CGV 
IM 


8226 


22127 


A 


8286 


1 


322 


TNTLLALLLFI 2TF*LPQLNGYVEKSTP 
YECGLDPISPARVPFSIKFFLGAITFLL 
FDLEIALLLPLP*AriQTTNLPIiIGMSAIi 
LLI I 1 LALRLAYE * LLKGLD * AE 


8227 


22128 


A 


8287 


395 


181 


HGSLHPGTPGLQQFSRLSLPSSWGYRHV 
PPCLANF* IFCRGGVSLCCLGSVEGLEC 
QARLPVLVRAELGLD 


8228 


22129 


A 


8288 


3 


246 


SSTLSFPSG*DYRHVPPCPANFFFFWIL 
EKTPFCFVPQGGFNLWA*TSPLNPPPWA 
S KGAKI TGGTPGMGLINIiANGRP P 


8229 


22130 


A 


8289 


138 


421 


LLSSVFFFFERDFCFVPQAGGQWGNLG* 
WNPWFRGLKNFPGLTPPRSWNNGGGPPP 
PLI FCFFLKKKRVS PRGRGRANFLDLGT 
PPPGPPKERE 


8230 


22131 


A 


8290 


141 


375 


RTFFFFFKTEGYFVAQAKGQGGDHGSLQ 
PQSPGLKGSSLLRRWDYGSMPP CPS I FF 
F * I F * KMGAS L CGPNNS KTGG 


8231 


22132 


A 


8291 


407 


209 


IGGPPGFSPPPVFKNRPPNLIFGPPKKK 
KNFPPPGGKNWFF*KGPPPFFFFFFFFF 
FFFFFYYTLj 


8232 


22133 


A 


8292 


86 


433 


P PXPNKL I EXGGAF * KRHFFP P PRGRVR 
FFFFGPQKKTPAPGQNPGGGGKPWGSPL 
FSPGEKNXPPTHPXGGPVLGAPPPPPGI 
VTjI * KKTGIjPLFVGRAGTNPNLGGPPAP 
PPKG 


8233 


22134 


A 


8293 


147 


342 


LVKFEYTVLPKNRYRLGWAHAYNSSSL 
GGQGGRIT*A*EFKTSLGNILRSHLYYK 
KRKEKKLN 


8234 


22135 


A 


8294 


6 


152 


CCGHAP PHPANF YS L * RRSIAMLPRLVS 
NSWTQAILLP* PLKVLALQV 


8235 


22136 


A 


8295 


18 


205 


I P TRP P TR PIP GRIiNHFTFTATRPG V Y Y 
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NO; of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO; in 
USSN 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=lso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S— Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














GQCSEI* GANHS FMPIVLE L I PLKI FE I 
GPVFTL 


8236 


22137 


A 


8296 


1 


199 


YLYIYLS ILLS I YLS IYLS I YLASYLAT 
*LAI * KEI YYEE YVLVMTEAE I FHI LPS 
PSWRPRKVMV 


8237 


22338 


A 


8297 


1 


318 


PTRPPTRLLSRPIKIQIALHKFSLCPEY 
LQ* ADYLF* SLWIRPDAWPHACNLS IL 
GG*GGRIA*AQEFEISLGNIVRPCLYCI 
IKNKIKKKKPLQIIAFFLCFKD 


8238 


22139 


A 


8298 


416 


304 


GFLHVGQAGLKLL I SGHLPA* ASQNVGI 
TGMSHHARL 


8239 


22140 


A 


8299 


352 


31 


EMILKAAREKK* IPYNEPLICLTVDFLV 
ETLQARREWYDI FKVLKKKNKTKNPFYT 
KIEYLVKLSFKYEKVTKTLLDP*Q*LRD 
FINTRHANRNTSNRRKRP I I KHK 


8240 


22141 


A 


8300 


382 


266 


RWDFVMLPKLVSNS * AQAILHVGLPKCW 
DYRREPPHLA 


8241 


22142 


A 


8301 


2 


273 


PLTRGPSSASDALSSGGPYHPSECCFTY 
TTYKIPRQRIMDYYETNSQCSKPGIV*V 
VHTHHTGGRGSQQGLLEGSREWWWNGDP 
QRTSQV 


8242 


22143 


A 


8302 


77 


430 


GMGLSMHRTHFQYLAHIHPCVHVHTHTH 
THTHTHTHTHTLCM * HTE I HKS VRYGTE 
HAQD PFSI FS P YT PMCACTHTHTHTHTH 
THTHTHSVYVTHKVCEETCLSFFYPESG 
T I SHAQE I LKRCLENKTHLGVWCFTS LL 
SVANIVFFSYKRQVAAARGGLFFSLSQA 
E 


8243 


22144 


A 


8303 


137 


396 


S S VKTMNYLKP WHMVADRC * LFQCLPS 
KKKKKKKKKKKKKKGGPLKKKKI LTRGG 
GGI FFFFGAPKK I S PGRV* KKKKKKKKK 
KKKKGG PL KKKKNFNP GGGGN F F F FWG P 
KKNIAGARLKKPGGGKNPRSP 


8244 


22145 


A 


8304 


329 


144 


NGPPFFFFFF*DGVSLCCPGWTAVARSR 
LTFAVFFTLNCSWGIHSELQHWFQDYL 
KLKTF 


8245 


22146 


A 


8305 


404 


157 


RT YN P S TLRGRS GQ IT* GQD FD T S LGDR 
VGPHLRKRKRNYYSLVECFYRIKEKYPE 
SHKKAMKMLFQLKPDFLHI FQHIATN 


8246 


22147 


A 


8306 


394 


294 


S LCFLTP PRDFNLGAL I KNFYS PTPGWV 
S CVLS KGPPFFFFFFFKDGVLLS CPH* S 
QNPFVFALEWSGTPELK* S SHPGLPKAW 
DYTYEPPHQAQVELLRY*RSFPVLLLDG 
ICIFHSFFSPRGSLIP 


8247 


22148 


A 


8307 


280 


2 


WDSHVTFFTS VI TVMLFS I Q YVKQLEN 
I KSWPG I VARACNP S ALGGRDG* AQEFD 
S S PGNTAKPGLPQKNKTKQKTKNKVGGT 
LETRSWRL 


8248 


22149 


A 


8308 


136 


330 


MS YQRKS VNKYKSNlSrACEEVTTWARHSG 
SCLRLYNPSTLVGSGGKIP*TQEFETSL 
SNIARPYLY 


8249 


22150 


A 


8309 


3 


118 


I FWLVQMGFHHVGQAS LEL * PQ VI HPP 
QPPKVLGLPA 


8250 


22151 


A 


8310 


2 


366 


TRVDPRVRVRSTHRNLCLPGSSHASTSA 
S * VAGTTSACHHARLIFSFSFFFLGFSE 
KTGFNFFAQGGFEPLGEREPPPFGPPGF 
WGGGGGGTFKRPRRWKFQKD 1 1 P PLVSN 
RGDQPEPFF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine ? R=Arginine, S=Serine ? 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *— Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 


8251 


22152 


A 


8311 


253 


446 


SKTFQARGGVTEMGFRHVGQTGLEILTS 
GDPPASACQAAGI TGVSHHARLR I * YFI 
RIFILMTC 


8252 


22153 


A 


8312 


430 


88 


G VGR I Q I S WS P WLPKPKNS PGS PPQ * VG 
APGGPPPPGPYFFFFFQKKGFPIjFSGRV 
*GSGSKNFFVFPPPKNWGTSPPPPPEGR 
GFFFFFFFLSRDGASPCCPPRPPKVLGL 
QV 


8253 


22154 


A 


8313 


400 


323 


HRLMPPHP IDFFLFFVETGS CYVAQGAC 
E LLG S GNL PGS AS * NAR I TD VNHHAQ P * 
KSSSGPGTQQC 


8254 


22155 


A 


8314 


4 


421 


GGRIARTQEAKVAVSQQCTTALQPGRHS 
KTFSQKTKKKKSPGGPPGPPPGRFFFFF 
KT * KKG P FLKKP P Q RGGD P RG KKKRGGG 
LWKKPGGXLGGPTKKKTRGGGANPPPLW 
GKGGGP KGAPRAFP FTKKKP P P S GRNQ 


8255 


22156 


A 


8315 


213 


32 


RCPGTVGHGCHPSPLGGQRGRIP*GQEF 
ETSLANMVKPYFFLNS KKKTKNHPTKKN 
SQAW 


8256 


22157 


A 


8316 


422 


153 


AATLYTHKTPHCIYFFLDRNLLCCPGWS 
QTPGLK* AFCLSLP S S WDYRHKP PRP VT 
SPIFRIKSYDVIPPPYLE*NPMMLYLPS 
YCHHI 


8257 


22158 


A 


8317 


1 


284 


ERQDWESRLEAMECAFHLEKSVNQSLLE 
LHQLAMEKGDPQLCDFLESHFLNQQVKA 
I KKLGDYLSNLCKT * APEAGLAE YLFDK 
LTLGGSEEDT 


8258 


22159 


A 


8318 


187 


37 


TRDFFWPGWAHA*NCSTLGGQGRRIA* 
AQKFESNLGNRVSHCLYKKFKN 


8259 


22160 


A 


8319 


2 


250 


KYEQSLQETODCVKRPNLRIIGVPEEEE 
KSKSLENI FGGI I EENFPGLARDLDIQ I 
REAQRTPEKS IAKRS S S RH I VIRLS K * E 
VWDCVKRPNLRI I GVPEEEE KSKSLENI 
FGGI I EENFPGLARDLDIQ I RE AQRTPE 
KSIAKRSSSRHIVIRLSK 


8260 


22161 


A 


8320 


371 


196 


NNFSILFFFFETRSH*VAQAGLVLLGSS 
NPPCSASRVAGVTGVHQRYQQDDFKNFR 
HN 


8261 


22162 


A 


8321 


3 


235 


FISVSFKYSKHI PNNMHLKFL VL FLNFF 
FRDGFWLRSG*SWAILRCGHSSLQPLT 
GLKQS S HLS FQGS WD YRRAPH 


8262 


22163 


A 


8322 


1 


395 


LEVPLLNTSLLLASGVSIT*AHHRLIES 
NRNQI IQALLITILLGLYFTLLQASEYF 
ESPFTISDGIYGSTFFVATGFHGLHGII 
GS TFLT I CF I RQL I FHFTSKHHFGFEAA 
A* YWHF VDED* LFLYVS I Y 


8263 


22164 


A 


8323 


322 


83 


MHQS FL VAKVKKGKHGRWFYS CLM I KS A 
P VLS VTHNTLLC WPGAWHACNP S TLGG 
QGRWMA*AQEFETSLDNVGKPHP 


8264 


22165 


A 


8324 


12 


249 


GFFLPPPPGRGEKRGPPPPPLFFWFFFK 
KGEF PLGGGNFLTLE P PRLAP P KGGK* G 
GDPPPPGFFFFLKKKPKPLLKG 


8265 


22166 


A 


8325 


139 


335 


EN I LWAATWLD CHP S TLGG * GGR I T * G 
QEFKTNLANIERPHLFFFFFLKRNFFFA 
PRVEGRGPV 


8266 


22167 


A 


8326 


1 


246 , 


FRRGLAVAPRVLL C YPGWRAM VQ S * LTA 
ASNSWAQAIHLSLPKCWNYKHTPPYPTC 
*NSACWDESSFPLLVSAMEVCGQPHS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^GIutamic Acid, F— Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8267 


22168 


A 


8327 


422 


107 


IYYVRQQRKKRGLFFPFFRFPLPPGGGQ 
WGHF WFLQP P PRG V* RFF FLNLPRNR P W 
DFKGLPPPPAYFAFFLKKGVFPPGQGDF 
* ILAPKGPPTSTSQKGWGFKD 


8268 


22169 


A 


8328 


I 


161 


FRGIRWVCHVLFFYLSFFLRQGLAVLPR 
LILNS WAQATLLPQLPE * LGLQTSS 


8269 


22170 


A 


8329 


422 


103 


PPYSPPFGGPGGGVPKVRESGPPGPPQG 
NPFFFKKPKNNPGGGGGPLIPPSSGG*A 
KGMGL S PR P RL P L S * NWP P P P P PGGQNK 
LPFPKKKKKERKKEKFNFEHRG 


8270 


22171 


A 


8330 


46 


422 


STPFPLiASQSAGITGVSHRARRSSSFQC 
TQIPLFPETLPNLGFGSRHGRGFSSQVL 
PAQSSSFDLDLISVLGY*MLSHISLGPN 
RALGFIPPHLSKWENGSTLRSIVLSQIE 
GLSSTLSFPHPVS 


8271 


22172 


A 


8331 


429 


112 


FSHLGSSPRQGFTMLARLWNS*PCDPP 
TSASQSAGITGVSHHAQPHAVFYIRLPD 
SQVLCHQMP YWPTAHYPHRW I FSLVLQK 
NLS VPPPLWPIQSQPKAS ILLM 


8272 


22173 


A 


8332 


291 


19 


KTPFPPKI*KFPPNFLKFTPGCGGPPPP 
FLKKFVSPGNPPPKKIFFFFFPPPPPFF 
FFFFFLRRGS LIS I AQDGI QWHYHGS LQ 
PQLPGLR 


8273 


22174 


A 


8333 


307 


123 


KYFRGGGGGGR* FPPLGGVKQKKRGNPG 
GGGSKKPKLGPSPPTRGKKKNFFSPKKK 
KKKKK 


8274 


22175 


A 


8334 


3 


169 


HASADMCHQAQLLLLLLVETGLCHDGRA 
GFGS ±SQEIHPPRPPKVLGLQHFGLIY 


8275 


22176 


A 


8335 


1 


208 


LCVCVCVCVCVCVCVLFG * WRKGLALS P 
RGE CSGTNMAHCS LNFSGS SNP S VS P SH 
VAETTGVRPILG 


8276 


22177 


A 


8336 


1 


352 


ILFYFIFYYFSFCFEMGSRSVSQAGVQC 
PDLSSLQPLPPRFKRFFCLTFPSSRDYR 
CVPPHLAKFCI IRRR*AAIGQAGIELLT 
LSDPPASAS*SARNTGVSHCSGPSFINF 
LNWQI 


8277 


22178 


A 


8337 


420 


272 


KAPCQFSGEKTVFPKHCWNSWVPFRKKQ 
QQKKNLCPCLVLYTSINSQ*TMGLNIKA 
RTW IS* KYKEDP* TGKK* INKS CPWTG 
KKKKFLARTQKTIAIWDTLINWTSPKLK 
MSTKQRHHLKMNR* PKWLLFFVS 


8278 


22179 


A 


8338 


427 


202 


LIIYFCHQSVQKKGVLPLLQEREGWAEA 
SLKI SNI SLSHTHTHTHACAHTHTHTHT 
HTSS*NGIKTVVTTrKSFA 


8279 


22180 


A 


8339 


2 


164 


TPGHGISLCRPGWSAVAPSRLTATSASW 
VQTILLPWPPE*LGLQESRLLRDHLC 


8280 


22181 


A 


8340 


422 


1 


PPGPPGGKTKRAPPPSFFKGPNPLLKGK 
GPGAKPP P KNG VFFFPPKKVKGP QKNFG 
GTQI S QPNKRGA* KFTPFFL * KPPQKKK 
GPPPGEKIFKGPKF*KFPSPGFFFFFLK 
QGLTLSPRLERSGANTAHCSLKLLSSSD 


8281 


22182 


A 


8341 


1 


456 


RTRGAVSQGHAIALQPGQQSETPSQKKK 
KKNPAPGGKGKIGGTAKKKGRNFLDANI 
LGPNSRPGGAFFGG*TAPPGKKRGSPGP 
IFGGGKGGPKKKIIiSF*KAGGGAKFFPS 
PVGTPFFRGF*KGKILLKKKKFSKPGGG 
NPENLSPTNFFF 


8282 


22183 


A 


8342 


407 


2 


RQFII*GAPGVFPPPGF*KPPPKI IFWGP 
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NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine 7 P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W= Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 














KKKKTFFPPPRGEKFFSFKGAPPLFFFFF 
FFFFFFSPFFFGALFFFFFTILFKTKFG 
VFSCKGIPPKFFKRSRPPNGAGRLPLLG 
VGPLPGVPQGGEAERKI SQGS FQ 


8283 


22184 


A 


8343 


441 


1 


PPPYILGGPRVFSPPPFFKTPPQIFFLG 
PQKKKNSPPPPGKK:iFFF*RAPPPFFFF 
FFFFFFFPPFFFGALFFFFFT ILFKTKF 
WISWKGIQPKLFKRSRPPNGGGRLPLL 
GGG P P P G VP RGGE AERKF S RG S FQRTGG 
SAPEFRA 


8284 


22185 


A 


8344 


466 


323 


FGVAGTQNPGRNPRVPSFRGGPPPGVPQ 
GRNPPPPWFFPLNPPKHFLTP*VFPFLE 
PP I E WGPGP P * P S YGFS S P VARK 


8285 


22186 


A 


8345 


657 


347 


LFFWFFLRQSFTLVAQAGVQ* CDLGSLQ 
PPPPGFK*FSRLNLPSSLDDRCPAPCPA 
NFVLLVETGFHHVGQAGLELLTSGDPPA 
SASQSAGI TGVS I SAQPAQ 


8286 


22187 


A 


8346 


3 


212 


DCSRGPPPHLANFGLLEKTGSNHVDQAG 
LKPLSLRDPPASAS*SPGITGGSHRAQF 
* F I FKRGMAPKGF 


8287 


22188 


A 


8347 


9 


255 


IPGPTHPSDFFLYEIDLPFKN* KGFPPP 
PRGGGKNPPFPPPPFFFS*PPWKKKITF 
QTPLTGVFGAPKNTFNFWGGFKKIGF 


8288 


22189 


A 


8348 


79 


312 


NAAGFEVGISKA*ALETLQIITREGHSN 
KPR YAVT FE SHEMGT VLHL VE QEHT VG Y 
ITASCSALKC* IESGGFLNDYI 


8289 


22190 


A 


8349 


1 


369 


FFFFLKRDRVSPLSSRVKYKWHDYSSLQ 
LQTPGLQ*SSCLRFLSSQDYRHIPPCPT 
NF * KCFVEMGS CC VTQGWS QTPGAQAVL 
PPWASQRHWGLQGVRPLCSLLLSNCPHS 
RLWSWLFSGC 


8290 


22191 


A 


8350 


434 


324 


LP I LS P VWWHMP WPGTRE AE A* EQ * S L 
EPGRRRLQ 


8291, 


22192 


A 


8351 


3 


253 


YMCSKFFFLECSGAISAHCHLELPGSSD 
PPPPT*NF*VIGTTGTCHHEWLLFELFG 
KMKPHYIAQVGFKLL.ERNFLKTKKVFP 


8292 


22193 


A 


8352 


3 


1431 


SPRGGIPSLKQVYSSLTTNSRPSQHLF* 
EEALQRAREEEEKRKEITSHFQSTLTDI 
QGQ I E Q Q S ERNMKL. CQENT E LAE KLKS I 
IDQYELREEHLDKI FKHRELQQKLVDAK 
LEQAQEMMKEAEERHKRE KE YLLNQAAE 
W KLQ AKVL KE Q E T VLQAQL TL YS GRFE E 
FQS TLTKSNE VFATFKQEMDKTTKKMKK 
LEKDTATWKARFENCNKALLDM I EEKAL 
RAKE YE C F VMK I GRLENL CRALQEERNE 
LHKKIRDAEISEKDDQSQHNSDEEPESN 
VSVDQE IDAEEVNSVQTAVKNLATAFMI 
IHHPESTPHQSKETQPEIGSSQESADAA 
LKEPEQPPLIPSRDSESPLPPLTPQAEA 
EGGSDAEPPSKASNSPAGLGAETQCEGL 
PVGAQADQASWKP E AEASGQAPQAPTE A 
S LQ KME AD VPAPACAAEEHVAAMVP ACE 
PSRQPPRAAAEELPVGASAGPQPRNVA 


8293 


22194 


A 


8353 


376 


65 


IHCGDHVQEVPKLKVCALHMSSWACSQI 
PKAGDKILTFDQLTLDTLKGCGTILLSG 
PHKGQEVYWHFSKALGTQHSHTKPCVHS 
RD QKF KH I RG * WAS QG YKN 


8294 


22195 


A 


8354 


459 


85 


' LEEVESLERVLEKKMIESEELKSKQTRF 
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NO: of 
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M 
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Predicted 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D~Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine 7 S=Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LEE I KNQDKLNKSLKEEAMLQKQS CEEL 
KSDLNTKWELLKQKTIELTRACQKQYEL 
E*ELAFYKIDAKFEPLNYYPSEYABIDK 
APDESPYIGKSRY 


8295 


22196 


A 


8355 


231 


158 


PFF I F YL YQSND I ARGLERGLE PEKVI G 
ATDSCGDLMFLMKW*ESVAPMTFSGSRP 
LSKPRAISLLWYK 


8296 


22197 


A 


8356 


2 


164 


I HRGVENQMAGEQGWDKDVS VYWVGAVA 
HACNPNTLGGQGRWI T*GQBFETNLP 


8297 


22198 


A 


8357 


23 


291 


CTHTRAQAHAHTHHLFYYFLYI FNFFEE 
TGSH* IVQACLELPASSDPPASFSQSAG 
ITGMSHHAWPTSCILKVKMALCSINCSA 
ACFLY 


8298 


22199 


A 


8358 


1 


307 


FFFFEIESCSFAQDGVQWCDLGSLQPPP 
PGFKQFS CLSLLS S WDYRHVPPRLANFC 
I FSRDGFHHVGQAGLiQLLTSND PPT * AS 
Q SAG I TGVS PRAQ P C P AF 


8299 


22200 


A 


8359 


75 


266 


FVKTRNLFLTVLEAGKSNIKARAG*VSG 
EEHGLCLQDGALLPLEGRNAVSSHPRGQ 
JNTMMRAKC 


8300 


22201 


A 


8360 


1 


295 


NTWVFCFVLLFRDRVSLCCSVRSTVASS 
*LIWMNLSFPSS*AYRHVSSHLDFFFF 
FKIQR I L I LPRL I SNFWPQE I LLWLPQG 
LGLQPRPQVLVGNS 


8301 


2.2202 


A 


8361 


376 


293 


MGFHH VS QAGL KL L TS GDL PAS AS QS AR 
I TGAGHHAQPDF * HSGDS WQAP PE PRDQ 
G 


8302 


22203 


A 


8362 


375 


142 


PQEEFLRFATDVGEYRAVTELERPVSES 
WNTQKDLL * QKRGRVSTYCRHTYGVGES 
FS VQRRGEHVGGRGLGPCELE I 


8303 


22204 


A 


8363 


36 


363 


Q LS FNP L I GKKVKLRT VF I GKQ FL FL LG 
GGVRE AGAHSGAQAT VRWHNL S SHAKRP 
S CLS LPS CDHRHLLPHPANFC I FWTD* I 
PPCRSGWVQTPELKQFTHLGLPKR 


8304 


22205 


A 


8364 


455 


235 


GAQARAPPRGIIFFFFW*RKGFCMLPRP 
VSKFWG*WPLPWPPKRVGLDSWAPPPR 
DFIFFSFFVKKRVCILW 


8305 


22206 


A 


8365 


432 


175 


STDQVS VAMGPS * PGAGYNLL VCRWLR P 
LKKCS I RMGVS * F S RYCLS WL P LD KKGN 
S LTP CAS RVRRCT VMLRLMLRGLH P LS D 
KP 


8306 


22207 


A 


8366 


383 


83 


RRLIVFICYFSKK*GLPLLPRLVSNFWP 
QVXLVPWPPKVLGL *ALKKL WLDMVTHA 
YNPSTLGGQGTRIT 


8307 


22208 


A 


8367 


27 


304 


FFGFILTLKKSSN*CTLRFCFBSIGKVR 
IHTLGNILSRRGYGERCSLPHCLWESIC 
LPVQSN*ALHKKVQMHIPFDQAIPSRGI 
LPTDTGHK 


8308 


22209 


A 


8368 


165 


204 


QNI P I AVSFFFFFLKRVS FCPPG* RAGG 
QSLFIGALTFQV*GILIPYPPGNWELRG 
PPPNLTNFF* FLNLRFLLGPINLHYVHD 
KTS PLL YLFFFFF 


8309 


22210 


A 


8369 


400 


201 


KERPLPFRLIGSPFQGTWPRFFSKGEVK 
NSNFSKY* PPPFINPAPRVCFEGIEVIT 
KFFQVRPFFP 


8310 


22211 


A 


8370 


21 


408 


LIPLSTYVCVRVCECVCVCVCVNI*GYA 
CNT LL I ACN I LQD VP FT S P TKD K I F S GD 
FLTLVGKK 
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ing to first 
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Predict- 
ed end 
nucle- 
otide 
location 
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peptide 

sequence 


Amino acid sequence (A~Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G==Glycine, H=Histidine, I=IsoIeucine, 
K~=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q = GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, "—Stop codon, /=possible 
nucleotide deletion, \= possible 
nucleotide insertion 




22212 


A 


8371 


1 


94 


NrLGHEN*BNSGFGGC&ilPRSCHCTPTW 
ATE 


8312 


22213 


A 


8372 


546 


972 


PIHSLYGR*PSFCLGLALFKTLQVMHFL 
KDSVRNLKSSNLKVGLNFRSPQ IQAT I S 
, PPLQPKTKTYEEDPKSKPKKEEKNME VTT 
MKEWILRYAEQQNEEEKNENSKSLEEEE 
KFD PNER YLHL AAKLLiDAKE QAAT F KLE 
KK 


8313 


22214 


A 


8373 


502 


178 


QHFWRLRQKD S FALGG* KQPGQKGENP F 
FQKKAKNI LG * WDQ PWP TTQ KGGGGE S 
L* PGRTKS PP CTSGWGKKQNP VLKKKKN 
PQKTNCIPLCIYHIFLINRKTNTV 


8314 


22215 


A 


8374 


362 


112 


YRMS VNPQVRAKRVSQRTNTP PNFFF * D 
RVLLCRPG*STWQSQLTVAS I SWAKAI 
LLPPQPRQAKYFLRLVKCSSEKGERIE 


8315 


22216 


A 


8375 


411 


21 


VLGIYPTLVSPSPGLKMTRVKLPSGSKK 
VISSSNRPVFGVVAAGG* FDKP ILKAGR 
ASSKYKAKRNCWPRVRGVAMNPVEHP FG 
GGI LQR IGKPS AI R I YAPAGRKVGL I AV 
RRS GRLRGANT VQE KEN 


8316 


22217 


A 


8376 


450 


108 


AEGGRGPRK* * LRS CRPVDGSAGQS PGP 
LiRTESIiETPGCSPQGWQELEPPPRRPQ 
WWPGPPGHRDRRQHRRPGARSRQQGPES 
SH*QRSGESRPPAVPRPPCPLPWKKGVL 

TP 


8317 


22218 


A 


8377 


25 


306 


EQQPPPVAVHQSGPPGAPPEASPSHPGS 
S EKPQ P VP PPRAPLVPQP KP * LS PPHHP 
AAGLIiAVHAIiAGGPGTVQRGIiPGRAARA 
Vb VKV FAARL 


8318 


22219 


A 


8378 


376 


149 


PAHCSLL I PGFRGS S CLSLPCNWYPGRA 
P PRVGNFFFFF * EME S CYVAMLP SLVLT 
SWAIAILPVLPPKLPGLQA 


8319 


22220 


A 


8379 


98 


374 


S VPP P VGT YEGDAAP TGGGGL PALGR * G 
AAGGHVQQGHPGPHRGHAAPKP S AQGGP 
GHSRECLEPSGGAAGGEREPPRAGGGQS 
WMRAPSMT 


8320 


22221 


A 


8380 


2 


807 


RPQCCVRAPRAGAVSGVPKLWSVP*SGR 
SAPHPTRS * VRSWSRWDRTQS WQTSRTR 
RRSES KVRVSYQAWTASPPPAASASARP 
A*RTSSRSYSSSSTCCSWLWPSSWSTGP 
SQTFVRNSSTLSCLCLTRKWIAMMPQVIi 

VT*IAPPRGSTTRTPSPIRL*NLP*LSR 

(** DfTV* TfCf 1 OTaTCC C Q C 7\ * f T r Dt7\rD r PC! 7A T3T T 
Kjf \sti\.^ iVbUbWoboobA IJKv VxCloAirJji 

TSSSLLSRSSCKAQTG*ASCRPVRVPIP 
AGSSLGASAPGSRCHW 


8321 


22222 


A 


8381 


104 


327 


CIAFPEMLGIKYEVGKCPSGLKSLKILK 
NTHTHTHTHTHTHTHTHTHTHALYLIRE 


8322 


22223 


A 


8382 


400 


270 


KPKLEVRASGQVQWLSPIIPALEEAKAG 


8323 


22224 


A 


8383 


178 


588 


ADIjLS GAP GP PGGS QDNANS SEHS RACA 
PAAPGP * WLQPWLCP * WTQS PLEARGS * 
NPGVSHSSRIPGIPGAAGTSRPSGALPP 
QPAGHGGSSGELHVARGCACGRPLWSHR 
PGCGQLLRVTAADSHRCASRGAAPM 


8324 


22225 


A 


8384 


423 


78 


QGQEIAELTGVTVESPEEPQRLLVS*GG 
TVPCLACS LHRTL * TVGHGS AS S FIiAQG 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HHHistidine, I=IsoIeucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tnreonme, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *-Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DME PLGDRGLQGTEQItH * GGDRAS LQD I 
C FLL P VLS LLC P RS LDLHRHR VGVTG F T 
ED 


8325 


22226 


A 


8385 


434 


176 


FADDSRFDFLLLV*CAMLMLIREQLLEG 
HFTVNMRLMQDYPITDVCQILQKAKELQ 
DSK* PGGKRPMFGREASRPCAVAPGTHR 
NL 


8326 


22227 


A 


8386 


2 


305 


RLWEDPPGRNTPPGAR*PGPCGEGGHII 
REEQACGWPRGHSGPWRHPQAGPLQLGQ 
WRGKGAGQGARRSPQGSLESRPPTLSAA 
PPAIHICSVGGVRLHCY 


8327 


22228 


A 


8387 


331 


91 


EMRSHSVTQAGVQWRDYGSLQTRPPGLS 
FNVNWDWVGL*GDPS SPPMLLPS S * ARP 
GGGWGPGGEGHVS KEAAGRLQGPP 


8328 


22229 


A 


8388 


2 


250 


IHVRLWSQLFRRLRWEDLLS TGGGS CSE 
P * S CHCTVTWAFFLES CLRCAHWYVTL 
LTALLG SL P CAMP RQTAVS L PQAQLV 


8329 


22230 


A 


8389 


388 


176 


YWDS Q * CMPS FAVP VESACI CAIGHNFS 
KSVNSVIAVCVDGSSHKYVFTHDGRCHR 
E AFDVYRD I CDDDA 


8330 


22231 


A 


8390 


445 


330 


PPRFTPFSCLSLRSSWDYR* PPPRPDNF 
LSLYSVLHET 


8331 


22232 


A 


8391 


559 


142 


LRTL P VTLGKS LALS VPS F P Y VG GNW I T 
NRE P S AKGRGS G PNRG PGHAAS C P ARTH 
SLRHSPAGGVMLTQPRPILILPETQHPS 
ESAVARTD I S KARRWLHSRS P WPH* DE T 
SAGAERPG I RC Y I GHPAT 


8332 


22233 


A 


8392 


104 


400 


YYEIFKKEMYSYYHILHFFFFFSFFFFE 
TKPNS VNGPEGKGP I LG * L KP WLLG * KH 
FCLTLPSGGNYRHVLPPPVNGFFFFFFF 
KKKKVFPFVAPRGV 


8333 


22234 


A 


8393 


408 


169 


DQVW I KDWNIGSLRPR* KGPQTI ILTTP 
TAVK I E GI PAW I QHSQ VKPAAPE TWE VR 
PSLDNPCKVTLKKMTSPAPVTLRS 


8334 


22235. 


A 


8394 


1 


269 


YCCPLFSSKALTQENSPYSSFRLVNPPG 
LSLHPEGEGGKWINERGRELGPSAGPLL 
LFLHFAEAGRRQPPDWADSEADLQQVRH 
KLGQP 


8335 


22236 


A 


8395 


2 


204 


KDCKVNKEVERVLREFHQAGKP IGCDAS 
TSLPAPAPWWSWNAALCCPGLS CVWCWQ 
GBSK 1 CVGEHQ 


8336 


22237 


A 


8396 


3 


285 


KVTVKYDRKELRKRLNLBEWI LEQLTRL 
YD CQEEE I PELE I DVDELLDMESDDARA 
ARVKELLVDCYKPTEAFISGLLDKIRGM 
QKLSTPQKK 


8337 


22238 


A 


8397 


1 


192 


ETGFHHAGQDGLELLTSGDCPSLASQSA 
RITGVSHCTRAQLLTEQEYFKJSrQNHLPS 
APFYKRAF 


8338 


22239 


A 


8398 


8 


241 


LNPFLYAFLDPRFRQACTSMLCCGQSRS 
SGTSHS IRGEKSAS YSSGHSQGPGPNMG 


8339 


22240 


A 


8399 


2 


240 


S CLNP FLYAFLE PRFRQACTSML CCGQS 
RS AGTLHS SRGE KS AS YS SGHS QGPGPN 
MGKGGEQMHEKS IPYSQETLWD 


8340 


22241 


A 


8400 


10 


272 


CNWQNAYLTWRCAQDCEDYFAERLYRS 
MKGAGTDEETLIRI tVTRAEVDLLGIKP 
KFHQKYQKSLSDMVRSDTCGDFRKLLVA 
LLH 
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SEQ IB 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K==Lysine, jL==Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Serine, 
I— l nreomne, v™vaiine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /=possible 
nucleotide deletion, V— possible 
nucleotide insertion 


8341 


22242 


A 


8401 


47 


248 


TRG I FFFFLNTRSHS VAQAKGLGHDHS S 

XjJa.it ^ Jr X^J^JjJX^orx^XJoXjXJO JL WW X VJVJj_iJ_iirJrt 

SALWKAEAAKS 


8342 


22243 


A 


8403 


497 


303 


TVEASVSYLLYVAMVMQLPWGKSPASGA 
LLPQALLY 


8343 


22244 


A 


8404 


3 


235 


LTLIAHCRMMAAAHMGGPPMMPMMGPNT 
PGMMPVGPAPGMRPPLGGHMPMIPGYPM 
MRSPARLMSyiVPSQPRMTRPDR 


8344 


22245 


A 


8405 


3 


246 


LVFSVDVGLLAHTRQRGLKRSSGMSSLL 
GKIGAKKQKMSTLEKSKLDWESFKEEEG 
IGEELAIHNRGKEGWIKGAQPLGQ 


8345 


22246 


A 


8406 


1739 


804 


WEPDVSGSAAWLASAAFGMATFSGPAGP 
I LSLNPQEDVE FQKE VAQVRKRI TQRKK 
QEQLTPG WYVRHL PNLLDETQ I FS YFS 
QFGTVTRFRLSRS KRTGNSKGYAFVE FE 
SEDVAKIVAETMNNYLFGERLLECHFMP 
PEKVHKELFKDWNI P FKQ P S YPS VKR YN 
RNRTLTQKLRMEERFKKKERLLRKKLAK 
KGIDYDFPSLILQKTESISKTNRQTSTK 
GQVLRKKKKKVSGTLDTPEKTVDSQGPT 

FKQPISCVKEEIQETQTPTHSRKKRRRS 
SMQ . 


oj4o 


2.ZZ*\ 1 


A 


OA AT 


2 


241 


E AGL S Q SHDD L SNATATP S VRMKAGS FS 

JXJtCXj J. X £ I\Ox\.r x\J-MN ^JoJ Jr o Jr Ju 


8347 


22248 


A 


8408 


285 


186 


KQGLTSLPRLVSNSWAQAFLLLWPPKVL 
RLQA 


Q1AQ 


2224y 




o40y 


1 


246 


TDXJ"C , "NT"D T7 1 IT 1 T r"\\7T7\T\T PTl/VTlTT HT PHti'D'C 1 

KJriJl JN K. Jtli JL Jjy V ± AU Xj L. JL K X JJ 1 ij L. J. DJi tLh 

MNGVEYIWKGPCFCLTTFTLYLKGDIFP 

K.Jr Jro^V^XJlNijVVX V XjXy irl v i X ir V J.rii_iX. 1 


8349 


22250 


A 


8410 


1 


102 


AGS YTLHI IKGDDGTRGVTGRFTFTLHR 
IQTSEE 


8350 


22251 


A 


8411 


877 


647 


FFFETESHSVTQAGVKWGDLGSLQSLAP 

(T)Zk r VTPO (~*T ATT DOODn VD C T) nnnT nUTCr^m 

(jFis.is.X 1 bCijNJbir'ofaKUxKbJr'FFFJ-iANrCl 

FSKNGVSPCWPGRSSTPDRR 


OJJ 1 






QA 1 O 


1 




T3MT Mm*"! 1 ! ."O TT.Tr 1 dt<T W D VTVTO T/sT 7\ ATA T'tJG 
K.l v iJ_iiN J-Jr».XiJK.lJJJ- vjtVjJN r f JM^^WAA.±i\lrio 

FEFAQFDNFLEEATR 


8352 


22253 


A 


8413 


97 


401 


CNPENGIiTAS ALGRRM CLATCKAPR TL P 
DSGDTASCRFPAVPRPPHSRRSSGSGHL 

TDfiDDXP r 1 "D2iT.Dr27.T7TA7CTsTr>Dr i D 1 D , T 1 Cr , VT "CD 
X-'^Ji<.ir±<.L. Jr^-KXJ f VjXjxSWoIn irir^jir ir X otj X Xjx? ±r 

TFSTPAAHSHQKTLLGI 






A 


5414 


"3/1 Q 


Mo 


V OHUjr J. X X^^njZrJKX 1 OXJXIX ±U X O WJxXXjJ.X' 

Y I NHD FKLE RE VWICRLHDEG I IRLYQRL 


8354 


22255 


A 


8415 


714 


410 


LGVFRSALHGSLWLLLRSFPQKSPMPLA 
LLLFLQCNTAYQCLLIADQHCRTRKYFL 

^J.if\oljJ- r^VonvW V XuXJoOrl/xiNyXjyjN x HXM 

YLLPAGYSLEEQRILDW 


8355 


22256 


A 


8416 


2 


102 


GHDPQDRLl^AQDSEHNHSDRNWQRWRPW 
PKDSY 


8356 


22257 


A 


8417 


1 


309 


F YFENALSKSNKP IHTI ILNPHVHLVGD 
DAACIAYIRLTQYMDGSGMPKTMQSEET 
RVWHRRDGKWQNVHFHRSGSPTVPIKPP 
CI PNGKENFSGGTSLWQNI 


8357 


22258 


A 


8418 


3 


242 


ARALTNAASHVDDMPNALSALNDLHAHK 
LRL.DPWLKLLSHCLLVTLVDHLPDEFT 
PVEHACIiDiWLASVSTVLTFKYR 
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SEQ ID 
NO: of 
nucleotide 


SEQ ID 
NO: of 
peptide 


M 

eth 

od 


SEQID 
NO: in 
USSN 


Predicted 
beginning 
nucleotide 


Predict- 
ed end 
nucle- 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Th reon ine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8358 


22259 


A 


8419 


711 


296 


NVANS DGL IAS L WKE YGKADARWVYFD P 
TIVSVEILTVALDGSLALFLIYAIVKEK 
YYRHFLQ I TLCVCELYGCWMTFLPEWLT 
R S PNLNT S NWL Y C WL YL F F FNGVWVL I P 
GLLLWQSWLELKKMHQKETSSVKKFQ 


8359 


22260 


A 


8420 


3 


97 


FWSSLCNGLIAAQLLFYWNAKPPHKQK 
KAQ 


8360 


22261 


A 


8421 


1 


279 


GCGIADLAMSAIFNFQSLLTDILLLICT 
CAYI PSLAPNLLDRNKTGLLGI FWKCAT 
I VERKS P YVAVCCI LMAFS ILL I QKLVK 
MPQCICHNI 


8361 


22262 


A 


8422 


1 


228 


RHEVF I ELNHI KKCNTVRGVFVLEEFVP 
E I KEWSHKYKTPMAHE I C YS VLCLFS Y 
VAAVHS S EEDLRTP PRPVS S 


8362 


22263 


A 


8423 


3 


182 


SEDTGEEQWTAEFINRGEYE IDIAGYR 
FQAKAKLYPVASLFTQKRRKDDMELSDL 
HGK 


8363 


22264 


A 


8424 


1 


70 


VQFVFDAVTDVI IKNNLKDCGLF 


8364 


22265 


A 


8425 


2 


151 


ETTAS S CTPASLESRRCCAP CRMPRTGF 
FGSSPLWRPSGSRSLKPGFQQ 


8365 


22266 


A 


8426 


1 


187 


RGRVGPGGERLVPGVPGAEAQQ PAGDGV 
RAGP LQARP PAP VGVS QGRCQAAGAAAG 
PPRPDG 


8366 


22267 


A 


8427 


520 


423 


GKYQLQSQENFEAFMKAIGECWTERQSC 
GSHQ 


8367 


22268 


A 


8428 


1 


627 


GTSGTRGVTGYFTFTLYLETPKPS ISSS 
NLNPRE AME TV ILTCDPETPDTS YQWWM 
NGQSLPMTHRFQLSETNRTLFLFGVTKY 
TAGP YE C E I RNS G S AS RSD P VTLNLLHG 
PDLPRIHPSYTNYRSGDNLYLSCFANSN 
PPAQYSWTINGKFQQSGQNLFIPQITTK 
HS GLYVCS VRNS ATGE E SSTS LTVKVSA 
STRIGLLPLLNPT 


8368 


22269 


A 


8429 


3 


390 


I LGCNILRVE YSLVI CVSVPGS KKVILD 
LPLVIGSRSGLSSRTSSMASRTSSEMSW 
VD LNI P D TP E AP P C YMD VT PE DHRLE S P 
TTPLLDDMDGSQDS P I FMYAPEFKFMPP 
PTYTEVDPGILNNNVQ 


8369 


22270 


A 


8430 


3 


208 


NGTHVI ILLCLKTCGTWNVANDMI VAS 
NLGTGVPNQTPVSSGDLLIRSNGLLIPG 
TCEITRLYTISE 


8370 


22271 


A 


8431 


3 


563 


LPTSRVDPRVRLDRMKKDQEEEEDQGPP 
CPRLSRELPEWEPEDLQDSLDRWYSTP 
FSYPELPDSCQPYGSCFYSLEEEHVGFS 
LDVDE IEKYQEGEEDQKPPCPRLNE VLM 
EAEEPEVLQDSLDRCYSTTSTYFQLHAS 
FQQYRSAFYSFEEQDVSLALDVDNRFFT 
LTVIRHHLAFQMG VI FPH 


8371 


22272 


A 


8432 


2 


119 


DGDNILVTTFI YIKSVTELNGDI ITNAS 
WRCSERVAPSA 


8372 


22273 


A 


8433 


192 


1285 


AGVLS I IEETDSEGLQTKWENQTYDER 
LE INDSEEVAS I YTPTPRHQGLPRSAHL 
PNKAMADNS S DE C E E ENNKE KKKTS Q LT 
PQRGFSENEDDDDDDDDSSETDSDSDDD 
DEEHGAPLEGAYDPADYEHLPVSAE IKE 
LFQVNQLRCVPTDVLDHKLKPFIPDFIP 
AVGDIDAFLKVPRPDGKPDNLGLLVLDE 
PSTKQSDPTVLSLWLTENSKQHNITQHM | 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lrysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R— Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KVKSLEDAEKNPKAIDTWIESISELHRS 
KPPATVHYTRPMPDIDTLMQEWSPEFEE 
LLGKVSLPTAEIDCSLAEYIDMICAILD 
IPVYKSRIQSLHLLFSLYSEFKHSQHFK 
AjDAEGKKAFT P S SNS TSQAGDMETLTFS 


8373 


22274 


A 


8434 


3 


251 


TliQADHFNTRIiS CGDAAQTLWARTG YLG 
FVRRTELTAATGERHDALYWGALDETL 
E LRGLR YHP I D I E T S VSR I HRS I AE W 


8374 


22275 


A 


8435 


1 


225 


QTFNLEGSQI YEDS I VLQSVFKSARQKI 
AIEEESEDESNEEEDEDDEEYHEWKRYD 
RLGENMCLNCIWNGGGYIV 


8375 


22276 


A 


8436 


95 


340 


CGCGIAGIjAMSAI FNFQSLLTVILLL I C 
TCAYIRSIAPSLLDRNKTGLLGIFWKCA 
RIGERKSPYVAVCCIVMAFSILFIQ 


8376 


22277 


A 


8437 


1 


317 


GPKPLVRTSREPGKDVTTSGYSSVSTAC 
PTSSVDGGLGALPQPTSVLSLDSDSHTQ 
PCHHQARKSCLQWRPPSPPESTVSQQQV 
KRINLCIHSEEEDMNLGLVRL 


8377 


22278 


A 


8438 


80 


213 


PHLSFNAGITTIKVNIRNANSLGGGFHC 
WTCDARRRGTLQSYLD 


8378 


22279 


A 


8439 


3 


767 


HEDNIKQLKEMKFTYLINYIQDEINTIF 
NDYIPYVFKLLKENLCLNLHKFNEFIQN 
ELQEASQELQQIHQYIMALREEYFDPS I 
VGWTVKYYELEEKIVSLIKNLLVALKDF 
HSEYIVSASNFTSQLSSQVEQFLHRNIQ 
EYLS ILTDPDGKGNEKIAELSATAQEI I 
KSQAI ATKKI I SDYHQQFRYKLQDFSDQ 
LSD Y YE KF I AE S KRL I DLS I QNYHTFL I 
YITELLKKLQSTTVMNPYMKLAPGELTI 
IL 


8379 


22280 


A 


8440 


103 


354 


NGCECDFLFLFLFYFFETESRSVAQAGV 
QWHCLGS PQPS S PRFKRFS CLS TLGGRG 
GWIMMSGVQDQPGQHGETPFLMQAGLKT 


8380 


22281 


A 


8441 


3 


160 


KLYPLKIVFGMNGRVWVKGKTVQQTLIL 
ANVLE ACELMTLDQRI LMF ILAES 


8381 


22282 


A 


8442 


459 


3 


CGGLHP VRAS WLL C L P KQ AWAMAGAP P P 
AWLP PCSLI SDCCASNQQDS VGVGP S E P 
GAGYNLLVRRFLSLSEKRS IRVRVTRFS 
RCHLSPLSLTRKGNSLTPGTSRVRQCLA 
LliRLAHGALHPLSCAHCLALPSEMTPVP 
QMENAE ITRLL 


8382 


22283 


A 


8443 


2 


318 


RQGNMTAALQATLLNPPINTKS QAVTSTDR 
ARGI VLNVL ISFKAND IEKAVQS LDKNG 
VDLLMNYMYKGFES PSDNSNAMLLQWHE 
KALAAEGVTSIVRVSTALIPA 


8383 


22284 


A 


8444 


2 


109 


VTPFPMSCDLQGDCACRNPQAQEHSRKD 
LRGYSHG 


8384 


22285 


A 


8445 


'1 


69 


LVLNSWPQVIRRPWPPKWGLQV 


8385 


22286 


A 


8446 


1 


163 


PSEKHNIWVGVTQFSRCRLSPLSLTRKG 
NSLIP CAS RVRQCLTLLRLVHGACTH 


8386 


22287 


A 


8447 


1 


147 


GQAGLELLMSGDRPTSASQSAG1TGVSH 
RARPS SISFI LELRG S VRKKF 


8387 


222SS 


A 


8448 


914 


729 


GLTMDTQKDVQPPKQQPMIYICGECHTE 
NE I KSRDP I RCRECG YRIM YKKRTKRLV 
VFDAR 


8388 


22289 


A 


8449 


134 


392 


TCPPPPEPPS PLTCAVLVPQTRRWRGLG 
SLFRRACCVALPLQLLLLLFLLLLFLLP 
IREEDRS CTIiANNFARS FTLMLR YNGPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine 7 D=Aspartic Acid, 
E=GJutamic Acid, F=PbenylaJanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine r 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PT 


8389 


22290 


A 


8450 • 


37 


248 


AKI TPLLQPGRQEQDS I SNKKI LRCCLT 
MLPRLASDSSGFKRSSHFSLPTSWDYRH 
SPPPLASVPFLLMC 


8390 


22291 


A 


8451 


367 


165 


NFFVFLVETGFCHAGQARLQLLASCDLP 
VLALQS AG I TGVSHCARP I S TLKDT I KT 
SHFVPLKISGK 


8391 


22292 


A 


8452 


2 


282 


GKRMAGGPEMMQLRLDGNRLY I TTSMFS 
AWDKHLYPDLIREGSAMLHVDVDTALGG 
LKLNPYFLVDFWKEPLGPSLAHSLRYPG 
GDCSSDIWI 


8392 


22293 


A 


8453 


4 


163 


HHD FHAQ SLIANI ERIAI LVFAACE AAA 
VLSLLVSISNTYGLDYLHNLKLLQC 


8393 


22294 


A 


8454 


374 


197 


TPEVKQS SHLGLS KCWD YRHEP PHQARN 
STFDSNHSSIAAEDKLLGYFGLSQWQPG 
YQP 


8394 


22295 


A 


8455 


343 


161 


MGMGP S VP GTG YNLL VCRLLR P lekc S I 
RVGVSQFS KYHLS RLPL PRKGNS PTPCA 
FWVR 


8395 


22296 


A 


8456 


1 


193 


MGFLHVGQAGLKLPTSGDPPTSASQSAG 
I TGMSHRVQ PYYVL S I CQTLHS VC I CL L 

T TT T rprpT TT 

± X-LiIjI I V Lt 


8396 


22297 


A 


8457 


342 


263 


WLRP WPALWEAEAGGSLELRS SRPA 


8397 


22298 


A 


8458 


101 


303 


KAAKYLFQRLHFCFLNLKKDNMLLFNHR 
MENFKELFGPGTVAHTCNPSTLGGRGGW 
ITRSADRDHPG 




2229y 




8459 


3 


140 


YLKHL CM Y VCKYLRQGLLML P RL VLNS W 
AQRILLPWLPKVLGLQV 


8399 


22300 


A 


8460 


327 


248 


FPRLVLNSWPKAILMPWLCKVLGLQA 


8400 


22301 


A 


8461 


327 


79 


SIPFILAFFIYLSFFFKETGSQSVTHAG 
VPGI I IAQCGLQLLGSGDPPDSASLWG 
TTGTYPTPILSVFPNMPSFFILLCMLF 


8401 


22302 


A 


8462 


365 


273 


MGFQHVGQVGLRWPQVICLPWPLKVLGL 
QA 


8402 


22303 


A 


8463 


375 


217 


QAGLKLLTSDDLPASASQSAGITGVSHH 
AQPRHHFFKTRFIYIFFSSPSSVLS 


8403 


22304 


A 


8464 


238 


354 


RGKCF VQAFL YKE C S GGQARWLT P V I PE 
LWEAEVGRWIT 


8404 


22305 


A 


8465 


367 


248 


FQMRIHKWLIDLHS PSEIKQITS IS IEP 
GVE VE VT IAD I 


8405 


22306 


A 


8466 


341 


153 


NMILGWAQWVTP VI PAIGEAHWGGLLYP 
RS S KPAWATQ QNLL F TKKKKN I Y I YI Y I 
YIYERY 


8406 


22307 


A 


8467 


368 


101 


VEMGFRHVGQAGLELLTSSDPPASVSQS 
AGS S T S L P TAD LLT P GWVGVYGL VD KNP 
VTL I AMQRL PAVLEKKS VAR KNS EMKI V 
LKRGQ 


8407 


22308 


A 


8468 


362 


248 


HVGQAGLALLASGDPPTSASQSARIAGV 
SHRARPEAPL 






A 




Jo4 




GILETFSLIFTATMESRNYCVYFKMRNK 
HIISKTS 


8409 


22310 


A 


8470 


386 


300 


LAML P RLVPNS WS QWL S PL P PNMLGLQ 
V 


8410 


22311 


A 


8471 


380 


178 


PKKNKFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFMYIYIF 
AIVYQTLACNNKFLQWERVTRKLRQC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

ocl 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C— Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F— Phenylalanine, 
G— Glycine, HHHistidine, I=Isoleucine, 
K— Lysine, Lf=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosme, 
X— Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


8411 


22312 


A 


8472 


3 


154 


FL VDMG JHHVS QAGLKLLTS GDLPAS AS 
QS P I FFFLFFFFEKQ I IjAQAGG 


8412 


22313 


A 


8473 


3 


162 


GLYHVGQAGLKPLTSGDPLASASQSVRI 
TGVSHRTRPKI I FKLI FFLFFLiVA 


8413 


22314 


A 


8474 


266 


107 


LCLFLIEMRFPYWQAGVKLLSSSHLPS 
SASQSDRITGVSHCFWPEVFKRGIF 


8414 


22315 


A 


8475 


361 


169 


RLPGPSNPP X S ASLAAGI TGARHHAQL I 
FVFLVEMGLCQAGKHNFEKYSRVLIYSP 
IVSFTHFI 


8415 


22316 


A 


8476 


1 


154 


KKL I PTLMDHGAGFKS S VKE I TTAWEM 
AREVELEVE PEDGNELLQSRGKT 


8416 


22317 


A 


8477 


3 


193 


S H YQ S L I S S NHGHKHCGRP QGP L PRKT I 
DLCSLDYQLTFPPLLTHDPVKSPSVRNT 
QELSLIP 


8417 


22318 


A 


8478 


408 


103 


TGSRLTLSSRVECSGVI TAHCNLEIiPGS 
SDPLASASQVAETTGMHHHVWLIFLGIF 
L VKMASHY VAQDDLKLLGS TD P PVS AS Q 
SVG I TGI SHHAWAT I CYK 


8418 


22319 


- A 


8479 


59 


166 


KVNTCLCLHTHTHTHTHTHTHTHNYFKE 
TPIHQYI 


8419 


22320 


A 


8480 


394 


253 


FLLLLIjESGFCHVAQAGLKLLSSCDLiPA 
LASQSAEISGVSHCAQPLA 


8420 


22321 


A 


8481 


379 


184 


CFSLPKCWDFRGVPISPGDFWNFIIKLG 
VCP FCQGGFELL APND VAAS G FQNAG I S 
GINFWAWAL 


8421 


22322 


A 


8482 


367 


170 


CGLSSGLGGRSGRESERKKERERERDRE 
KKKEGEGERKRERERESVSSYKDTNSMG 
SGPHPYSLI 


8422 


22323 


A 


8483 


324 


136 


QRSS CLSLPS S WDYRRMPPHLANFCI FC 
RDGVLPCCPSWSQMTTSFLSLNFLICLV 
GIILGRG 


8423 


22324 


A 


8484 


1 


57 


GERRYGTCIYQGRL.WAFCC 


8424 


22325 


A 


8485 


35 


339 


GWLLRFFWWQWKNGNNL3STVS L I DGWLS Y 
DT LG YNG I LN CHM Y ILTEGDSQ KKKKKK 
KKKKKKKKKKKIDPAGNSKIYNRKIFKT 
PRGFFKWALGPLL ILFF 


8425 


22326 


A 


8486 


330 


66 


RPGVS RFtfPWP L I FFLRRVFLCRPGWNA 
WQSLLTKISASQVQAIFLGGGGCSEPR 
S CHCTPAWATTAKLCFKKKNGKNFTD I M 
FSK 


8426 


22327 


A 


8487 


3 


209 


CLSLLRSWDAQAPPPHPANFFLMFNFCR 
VGVLLCCPGLASSDPPPLVSQSARITGM 
SHCTWTIFIiSFK 


8427 


22328 


A 


8488 


3 


216 


GQSITVSCTATNSDVGSSNLVSWYQSSN 
Ij VS W YQQHPGQAP KL 1 1 YEDNKRLS SEK 
KKKKKKRKKKKKKK 


8428 


22329 


A 


8489 


326 


62 


PAYFS IGSFFLFLFFFKTECHSVTRLEC 
SGAISAHCKLRFPGSSNSPASGGGGRSE 
P KS HHF TLAWT TRAKLHLNKNK I KTVKE 
FLSS 






A 




1 


no 


ILGIVCQLRLDRWVCGCKQGQQR 


8430 


22331 


A 


8491 


399 


254 


KNNKRNKTINNKITNKLIIKLKNISFLY 
FFFFFFYFFFFFFFFFFFFF 


8431 


22332 


A 


8492 


2 


130 


FRRVGQAGLELLTSGDLPAS AS QGAG I T 
GVSHRAQPTFLKSC 


8432 


22333 


A 


8493 


3 


94 


HIiPAE F TPAVHAS LDKFLAS VS TVhTS K 
YR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbeny la la nine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
YV=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8433 


22334 


A 


8494 


327 


70 


KSGFSNVGQAGLKI LTSSDLPVS ASQS A 
GITGMSHCTWPLYPYFGLGSRSFHSIEE 
HCWYQLAPFPSCRNPFVIVPFFVHRFYL 
TF 


8434 


22335 


A 


8495 


97 


219 


PGTVAHACNPGVLGGQGGWMTSGQEFQT 
S LASAAKPRSTRP 


8435 


22336 


A 


8496 


3 


191 


GIRRGEILEVIEFTSNEEMLCRDPKGKY 
GYVPRTALLPLETEVYDDVDFCDPLENQ 
PLPLGR 


8436 


22337 


A 


8497 


144 


304 


SYFYE EILFSTVI KKNAIVS FTATWVEL 
EAI ILSELTQEQKTKYHMFSLVSGS 


8437 


22338 


A 


8498 


2 


90 


LTILVLAMWKVGFFKRNRPPLEEDDEEG 
E 


8438 


22339 


A 


8499 


1 


156 


PSS QD YMHAP PHLANFLNKF F VKWCLAM 
liPRLMSNWPQTILLSPPKALGLHV 


8439 


22340 


A 


8500 


67 


319 


KLMS FGPTLLS ARCVS TGQ I GMNELNLN 
I DAMSVAATINNE IMS FART WMKLKAI T 
LIILTLEHKTKYCMCSLISGSLRMTTQG 


8440 


22341 


A 


8501 


3 


299 


ETGSHCVDQGGVQWHISAHCSLDLLGSS 
DPPAS ASRWGI TS AHHP LLWS I SSLPP 
liLLHPHNLLNLLLLLLPSLS I VLNLHIL 
PPLLPPL.RLLCSPC 


8441 


22342 


A 


8502 


56 


142 


NNSAKKKKKKKKKKKKKKKKKKKKKKGG 
AAF 


8442 


22343 


A 


8503 


486 


348 


TGFHRVAQAGLELLSSGNP PALASQNGG 
I TGVSHHAQPAE YNFKK 


8443 


22344 


A 


8504 


158 


403 


RIFFFFETVGFVAQDGVQGRGAILjQPRIi 
FELKRFS CLSLPS S WDYKRAQPNPNDF F 
KRQMAFDPKI QR ISLGSXKKKNHP IE 


8444 


22345 


A 


8505 


3 


78 


DWHIAYVLLYGPRRVEIMEEESEQ 


8445 


22346 


A 


8506 


164 


291 


TRFCCCWLFWWFCVLCVCVAVFPPWCI 
LFFIWLFFFFFFF 


8446 


22347 


A 


8507 


372 


69 


CAPLPPLKFFFPPRGFIFRGGGSPIFPP 
PKKGVLPKNPPKGFFPPPPFLKPAPGGA 
FNYPPFGGKIFFSPPRGKWAPPGGFLKG 
APPFFFFFFFFFFLFF 


8447 


22348 


A 


8508 


282 


1 


SFQHVAQAGLKLLGSRDPYTLAAQSAGI 
TGVS HHAR P VL YFS S VL FSME FL S TMS A 
VILPCICSFCVSLAVPSSCILQQPLPPT 
CTYIKTETCI 


8448 


22349 


A 


8509 


400 


45 


TPPVSSLFFLPLKEIIFFCPPFRQSLPP 
PFFFFPPPLFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFKYLSLPFTL 
SKEKNK 


8449 


22350 


A 


8510 


241 


419 


TPLGGWEKKKKKKKKKKKKKKKKKKKKK 
KKKKNKKKKKKGGGFLKKPLGGANFIGA 
GNN 


8450 


22351 


A 


8511 


317 


28 


Q S IjS P S PQTV YF I YL 1 1 YAVFFT TLT 1 1 
LIFSPS FRMELKFLLRCYFFAP FYAFL F 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFSQPGAQTTAGRAGV 


8451 


22352 


A 


8512 


124 


292 


MCCYIMSLLKYICTIIKEFSTKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKIKK 
KGGVPL 


8452 


22353 


A 


8513 


1247 


910 


QAQLGD I GT S C YT KS GM I LCRND Y I RL F 
GNSGAAGACGQS I PASELVMRAQGNVYH 
LKCFTCSTCRNRLVPGDRFHYINGSLFC 



1067 



WO 01/64835 



PCT7US0 1/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lr=Leueine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














EHDRPTALINGHBNSLQSNPLLPDQKVC 


8453 


22354 


A 


8514 


99 


262 


MNKRFLLiNKKKKKKKKKKKKKKKKKKGG 
PFKKNPGGAQ I FPGGEKKI FFLKGGB 


8454 


22355 


A 


8515 


85 


2 


GWGWAQWLTP VI PVLWEAEAGGSRGQE 


8455 


22356 


A 


8516 


244 


423 


VFAVFNFLGWVMDMLINLAWI ILQCAC 
ISNHHVVHLEYIQFBFANYTS IKLGKKP 
SKSV 


8456 


22357 


A 


8517 


3 


114 


KINGEDKQKILDKCNEI INWLDKNQVCV 
FFFFFFPI 


8457 


22358 


A 


8518 


413 


91 


GGVRGK I S FRGGG FLKKKKKKLLKKKKK 
WGGGPKEKKKRGGFFFFFBLNFFFWGGV 
KKKKKKKKKKKKKKKKKKKKKKKKKRRK 
GRKEGREGRRKEGEGRQE INMTA 


8458 


22359 


A 


8519 


1 


163 


SRLPGRSRRRSRGRRRRRRRRRRRRRRR 
SRVL I FPWDQNBDLNTVYI IEDVDAA 


8459 


22360 


A 


8520 


191 


21 


RFTTHVDAWATVTVFLIESVFLHVGQAC 
BELL P S DDL PAS AS ENAR VTGVS HRAWL 


8460 


22361 


A 


8521 


2 


144 


AAATSQAWWCMPWPATWEAEVGASLKP 
GRLQLPBTMIABVHLSYAI 


8461 


22362 


A 


8522 


90 


8 


KPTPFBPTLIARTTLLLP I SPLILI IB 


8462 


22363 


A 


8523 


305 


146 


VFRIHSCGIRGSVDVKITDGLBVIRRIE 
3STVPPGPNNKNKNPYAIFQSSS I ESQ 


8463 


22364 


A 


8524 


176 


2 


LAWPFLFLPKCWDYRHEPLQPASLTTFV 
RE WDK I GWAW WLT P VI PAL WE AEVGRS Q 
GQ 


8464 


22365 


A 


8525 


2 


200 


CLIPS SWDYRHMPPRPANFCIFFFFLGD 
HVSQVGLKLLTS KDPLAWTSQRAGI TGF 
SHHTKPQGFF 


8465 


22366 


A 


8526 


2 


217 


SFSLLFPPSFSLLLPPLASLLLPPLPSF 
SLLFPPSFSLLSPPSFSLLBPPSFSIBL 
HPTFSHLYPPSPSFI 


8466 


22367 


A 


8527 


2 


186 


I RGLRP LE KRG I R VGVSR F SRYHLSRLS 
F ARKGNS P T P CAS R VRRC P S L LHGLHP L 
SDKPQ 


8467 


22368 


A 


8528 


3 


153 


CBGFLSGWDCRCIPPCLANLKKLFVEMG 
SLFWPQWILPPWPLKVBGLQV 


8468 


22369 


A 


8529 


2 


388 


RTAVHT F I L VL Q VC VCRWHB S LAALT PA 
HLVCSSHRILSYLSIRGSWDHRCMTTTP 
GLVIS IFFVEMKYRYAVHAGHIBLGSSY 
P AS A YQ S AE I TGVS HWAWP TD S BWQKPH 
MVI WCI S CFBNVL IGTM 


8469 


22370 


A 


8530 


293 


64 


YRISIKVSKWFKGWVSSFWIFFFFVFFF 
S FFFVFFCFFMFFBKQYLKF I INI 1 1 IB 
TQALYI CTTSKI CYSFLYFR 


8470 


22371 


A 


8531 


2 


250 


S CLGL P S S WD YRGT L P CLANF C I FMRD G 
VSLCCQGWSQTPGSQVIDEAVYS I YAYY 
I BALL I VYVAQWE VWLKHFCC I LKYT 


8471 


22372 


A 


8532 


2 


201 


FHHVGQTGLELBS S GDLP AS AS Q CAG I T 
GVSHCT WPS IBYRMNFKNBSAVFIAWVI 
itjGJM x T. JM is. 1 L- 1 


8472 


22373 


A 


8533 


319 


200 


T F T F F BQRWGL TMBPRL VSNS WAQAI C B 
BQPPKVBELWA 


8473 


22374 


A 


8534 


160 


297 


KTETLABVAHACNP STLGG I GRQVS S AH 
E FKT I LGNMTKP I L YTKB 


8474 


22375 


A 


8535 


333 


228 


PNRGH Y I AI VKSHD FWLLFDDD I VE VS S 
FLISYF 


8475 


22376 


A 


8536 - 


3 


325 


PPSSLLPCSWILDCCASNERDSVGVGPS 
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peptide 
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M 
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NO: in 
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09/515,1 
26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T— Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X^Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EPGAGYNLLVCCFLSLLEKRS IRVGVTR 
FSRCRLSPLSLTRKGNSLTPCASQVRRC 
LALLRLAHSALHPLSCSYCLALP 


8476 


22377 


A 


8537 


216 


5 


KLGFLVNLVEFQVKFFFLFFFLEIILIE 
MRSHYVAQAGLELLGSGDLPASASQSAE 
ITGMSHHALPNYS F 


8477 


22378 


A 


8538 


136 


22 


RPRRRLVLNPWPQAILLPWPPKVLGSQV 
WAT CGRRPAS 


8478 


22379 


A 


8539 


136 


22 


RPRRRLVLNPWPQAILLPWPPKVLGSQV 
WAT CGRRE AS 


8479 


22380 


A 


8540 


8 


319 


YIYMSMYMLIYVCIYVSIYICVCIYINS 
ISFNTYMI IYMLI YVYVYMCVYICVCVY 
IYTNSISFNTHITPIKHSDVHTVSTTRC 
NLGGRGFCHTWPLPHLAGW 


8480 


22381 


A 


8541 


331 


156 


SVNGLLSLPACLPVCLPACLPSFLPSFL 
PSFLSFFLLLLLLLQKPVNSLIFSQSLS 
IS 


8481 


22382 


A 


8542 


72 


1 


KIGQARWLAPVIPALWEAEAGGSP 


8482 


22383 


A 


8543 


2 


83 


QAGVQWRDLGSLQALPPGFMPFSYLGG 


8483 


22384 


A 


8544 


293 


2 


KTWHYLVNAPEVEIQAIYSQETCKSMY 
QETCTR IFIAILFI I APNWKQRQDTMNP 
GNKENGKVATLDS DGFLVRFQS LMRSNC 
PLYPFNKFPILLK 


8484 


22385 


A 


8545 


314 


122 


LVETGFLHVGRAGLELPTWSDPPTSASQ 
GAGIAAVSHRGQPVDFKNNISTQIQGRP 
IICNYKTF 


8485 


22386 


A 


8546 


316 


123 


L VKTG FliNVGRAGLi E I P TWS D P PTWASQ 
GAGIAWSHRGQPVDFKNSISTQIQGRP 
IICNYKTF 


8486 


22387 


A 


8547 


3 


247 


GIMATERLANYTGGIYAEYQDTTYINHV 
VSVAGWGISDGTEYWIVRNSWGEPWVRC 
FHFLKVI PRKKTS W YS SRLNGS VYV 


8487 


22388 


A 


8548 


289 


25 


QRREKFFKNGPPIFFRGPFGFPQIFVSF 
P FFFP KI GVFF VFFFRFS LVLKGDFFSN 
FLKINCFFFFFFFFFFFFFFFFFFFIKN 
RPCL 


8488 


22389 


A 


8549 


289 


2 


KKKIGETLCKLKLCKKFLDVTPKAQFIK 
Q KKE KIi I N WQ F I K I KNFRS P KALLRRG K 
GKPGTGRKRNANH I S DKGPLCR I QKKNS 
RGRAQWLTLW 


8489 


22390 


A 


8550 


290 


187 


RQENPLNLGGGGCSEPRWCHCTAAWVTE 
QDSINK 


8490 


22391 


A 


8551 


293 


183 


RDEGRG I S YML P RL VLNS W P QA I LLP Q P 
PKVLELQA 


8491 


22392 


A 


8552 


364 


87 


MLiPLDFFLGPSLDFCPPFCESGSGGIKV 
PESTTPRASAFLPPGAANLHHILQLVGA 
PRVPPGFHHVGQDGLDLVDLVIRLPRPP 
KVLGLQGG 


8492 


22393 


A 


8553 


57 


225 


KWQKWTPWAGGVAHACNLSTLGGRGGKI 
ACAEEFKTIIiENIVRPCLSKNKWISFS 


8493 


22394 


A 


8554 


326 


2 


KS KRGYSEPLVFS I CSARAPRSHSVSQA 
TVYGHAHASTQPFPPRLNRSSYLSLLSS 
GDYRHDPPPIiAYICISVIGNBVEHLFLC 
LLSTC I S S VNRSHVYS TLVP I FGP 


8494 


22395 


A 


8555 


372 


148 


YI CFLRSS VRQ I LFTPDCSDCSMWVPCN 
IRLPGS THWCASASQGAGTTGARHHARL 
I FFWVFFCI FS I FSRENH 


8495 


22396 


A 


8556 


422 


269 


AADQERLHTYQLNYYHFCKRWGLTTFPR 
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peptide 
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Predict- 
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nucle- 
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correspon 
ding to 
last amino 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, '—Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LVSSSWVQAILPTWPHKVLRLQA 


8496 


22397 


A 


8557 


669 


407 


AGVKWRNLSTLQPPPRRFKRFSHLSLPS 
SWDYRCPLPCPANYCIFLVEKEFRHVGK 
AGLE LLiT S GD P P AW VS L I AG I TGVSHRA 
HPD 


8497 


22398 


A 


8558 


195 


352 


LGLQAGTTAS GYS TVI LGGMLKPRSWAW 
WPWPATREAEVGGSLEPGRSKLQ 


8498 


22399 


A 


8559 


345 


219 


MLARM I S I SLPCDP PAS AS QS AGITGMS 
HHGWLNLiFKS IHL 


8499 


22400 


A 


8560 


137 


2 


VKYCLPTRSLIKTVRPLSPSQATASPVT 
CT YT P RW P E VT E E S Q KN 


8500 


22401 


A 


8561 


324 


193 


RSSGNLPASAAQSARITGVSHRTRPKCI 
I SKEYVAENNHLKS TS 


8501 


22402 


A 


8562 


326 


121 


HNHTHTHTKNHKKHQQPQHTHTYHTQTD 
IYVLQAASQKSLILISSLAV 


8502 


22403 


A 


8563 


3 


85 


HEHTHTHTHGHTHTHRHTHTHTLCARM 


8503 


22404 


A 


8564 


2 


153 


ARDRHI PPCLTNFS VFWRDGGLAMLPRL 
VS KLWAQAVHP P WLLKVLRLQA 


8504 


22405 


A 


8565 


2 


177 


ARVGFHYVGQAGLGRLTS GDP PAS AS Q S 
AG I TGMSHRARPMS S DTLHI LQDTWRAY 
AG 


8505 


22406 


A 


8566 


353 


212 


AGFRHVGRAGLELVTQDGPRTSACQSSG 
ITGMSHCAHPEKSTAFILF 


8506 


22407 


A 


8567 


76 


191 


ELIFCRDGGLTVFPKLISNTWPYAVLPL 
WP P KVLGLQV 


8507 


22408 


A 


8568 


1 


142 


GTRGFLHVGQAGLELPTSGDPPALASQG 
AQMTGVSHCASQKKEFLEW 


8508 


22409 


A 


8569 


250 


95 


SASRAPLPIiPPSHIiPLRAAGLSPLCPPR 
LVSSYRPHVILLPLPPKVLGLQV 


8509 


22410 


A 


8570 


3 


226 


HEHTHTHTHTHTHTHTHGHTHTHTHTLV 
LVYSLCEHIWRFIVISRARVCCIYWVF 
FECGYADRRVAD KWLWIH 


8510 


22411 


A 


8571 


1 


216 


GTRGLGVGVSHL VS KNTHTHTHTHTHTH 
THTHTATASQLSVFKIWAFQRSLDKGLT 
YISNSYCSSKFYLYNH 


851 1 


22412 


A 


8572 


356 


268 


LTMIARLVLNFRPQVIPQPWPPKMLGLQ 
A 


8512 


22413 


A 


8573 


12 


175 


RWGFHPVGQAGLELLTSSDLPTLASQKA 
E ITGMSHCARPS FFS FFLS FFFWKHL 


8513 


22414 


A 


8574 


1 


163 


GTRTG FHHVGQADLE LLTS GD P P VS AS Q 
SVG I TGVS HRAW P VGL C 1 1 AL I S NE Y 


8514 


22415 


A 


8575 


344 


176 


EFLVETGFRHIGQAGFELLTLGDQSTSA 
S QNTE ITGVSHPAPGLAI I FKSGNQAFF 


8515 


22416 


A 


8576 


2 


179 


ARG I L CHS GTFKLD S RP S M VAHTCMP S T 
LGSQGGRIACGQE 1DTSLLKVARPRLYG 
QIF 


8516 


22417 


A 


8577 


1 


100 


GTRG FTML PR WS D S WAQV I LL P W P PRV 
LELQM 


8517 


22418 


A 


8578 


. 3 


114 


HERHE ILS I YLTI YLS I YLS IYLSIYL I 
YIiPTITRI 


8518 


22419 


A 


8579 


1 


264 


GTRPCRIELYRWESIiAKAQETSGEEIS 
KFYLPNCNKNGFYHSRQCETSMDGEAGL 
CWRVYPWNGKRIPGSPEITGDPNCHIYF 
NVQN 


8519 


22420 


A 


8580' 


1 


245 


GTSGTSGILGFYLEREDRLLQIRTDTIT 
LSHGYSTFSLWRCDSRHHMNRMFQMLYQ 
S P FLGY I QLSHE VTiAFGI S Y I YNTE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
0=Cysteine, D=Aspartic Acid, 
E— Glutamic Acid, F— Phenylalanine, 
G-Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T—Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


8520 


22421 


A 


8581 


1531 


1457 


QARBTETIARRLWDVSCDLLGLPID 


8521 


22422 


A 


8582 


191 


2 


TITSYFSPTPAPGSHYATFCFYKFDYSR 
YFIKVELWEAETGGSRGQEIETILANTV 
KPRLSRA 


8522 


22423 


A 


8584 


2 


212 


ARAGIYPKEYKSFYYKDTCMRMFIAGLF 
TIPKTWTQSKCPSWIDWIKKMYRQGPIL 
SPMLECSGEVSGFK 


8523 


22424 


A 


8585 


2 


192 


ARGNS PASS S RVAG I TGAHHHARL X FVF 
FLFVRFFFS FLKTGFKLVAHSNFQPLTL 
FAPLHFK 


8524 


22425 


A 


8586 


277 


66 


PLD F RLGHKRRLP F PKKKKKE S KFVCVC 
VCVCVCVCFKRQCLVTLPRLVSNSWAQA 
ILLSWPPKILALQV 


8525 


22426 


A 


8587 


395 


286 


PHLGL P KC WD YRRE P P CLAE WLKK YVW 
VTYFSFRR 


8526 


22427 


A 


8588 


307 


177 


RYPSTINYS ILNRDKIMPKLDRMVYKAR 
PRVMCLPWPPKVLGL 


8527 


22428 


A 


8589 


167 


349 


RNLRELGEFFFFFFWFFFFFFFFVGKFS 
VFLLVGNLFLFFGGGIFGLSRVMCFKEF 
RCFS 


8528 


22429 


A 


8590 


375 


240 


LLSRFSWFCFVFRQGLTMLPRLVLNCWA 
RG I IiL P WS P KYLGLQA 


8529 


22430 


A 


8591 


347 


215 


ETGF PHVGQAGLKLiIaAS S D P P AS AS QS A 
QITGMSRRAQPSAKRF 


8530 


22431 


A 


8592 


340 


114 


HRTAHCSLYLSGSSDPATSSSQVTGTTG 
KDQHTWLLFKYFCRARVSKINKYKS INK 
FYFLNTQTTPSEYGKASPG 


8531 


22432 


A 


8593 


86 


253 


LRLCCGLSCSTAKKKKKKKKKKPKKKKI 
KKKWLHPKRGGFEEIDFFFPKGGKNLF 


8532 


22433 


A 


8594 


3 


135 


GF CHVGQVGLELLTSGDP PALAS QNAE I 
TGVSHRAWPPVLFFF 


8533 


22434 


A 


8595 


2 


274 


PRVRT I S LLG I L VYRSHL I S S KEjCLEG I 
ILSLFIIATLITLNTHSLIiANIVPIAIL 
VFAACE AAVGIiALLVS I SNT YGLDYVHN 
LNIiLQC 


8534 


22435 


A 


8596 


3 


316 


SIPYSWGEKEGIPCMAPPQIQGTSRLND 
FTALSLHLNTHTHTHTHTHTHE IQHLPP 
QGITALILNSLLRHYCPSIiAFPSLSPKQ 
NLT VRHC I L S RDT WLATAS K 


8535 


22436 


A 


8597 


385 


291 


KAPPLFFFFFFFFFFFFFFFFFFFFKPV 
KKILGV 


8536 


22437 


A 


8598 


275 


369 


GVQVLKLLTSGDLPALASQS AE I TGVSH 
RAR 


8537 


22438 


A 


8599 


98 


344 


VHYEE FMCVCVCVCVCVCVCWPRVRGVD 
MNP VEH P FGGGTHQH I GKP CT I RRNAS A 
GRKVRLMAAL WT VR I GGTKT VTL RKT 


8538 


22439 


A 


8600 


328 


195 


THVPPSLAKFVFLVKMGFLHIGQAGLDL 
STSGDPPALASHVFKR 


8539 


22440 


A 


8601 


2 


214 


HVDWMGT FA I L S E LHWDMLHVD P ENLR 
LtLiD S VLNCEIjANHFGTDFT p p vq aayq k 
WADVANALAHMYH 


8540 


22441 


A 


8602 


89 


181 


GLRHENRLNSGGGGCSEPKLCHCTPAWA 
TE 


8541 


22442 


A 


8603 


340 


57 


KEGHQEMEQAEDGGHGRKAPGTS PWTLG 
ELPKSARGHTGLFVSYRGCRTQFRFKKV 
LFGGDRNRGIAMLPRLVLNSWPQEILLP 
QPSKVQGLQK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalan ine, 
G=Glycine, H=Histidine, I=Iso/eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine 7 P=0Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


8542 


22443 


A 


8604 


330 


132 


TGFLHVGQSGLKLLTSGDLPTSAFQNAG 
ITGGSHCTRPFSLFHSYLVLFYGCSSSQ 
VSLSIIoARM 


8543 


22444 


A 


8605 


546 


279 


RPGPTVSPRLECICMIMAHCSLDFPGSG 
VS PTSASRVARTTSTHHHAWLIFYSSVB 
TGSHHVAQAGLELLGS SNPPI SASQS AG 
ITAL 


8544 


22445 


A 


8606 


3 


249 


NR FJNTLGS GG CS E PR SHHC T PS WATE QE T 
CITYLEYCFPPVSHTNLVYFKPLNVTIR 
SD YC I LGLHAS S I GGSD S L I I D CWD 


8545 


22446 


A 


8607 


159 


324 


VIQYRTNILHLHLCEVLVFWFLPYFFLK 
KTYTGIGQMRWLTPVIPALWEAEAGGS 


8546 


22447 


A 


8608 


334 


179 


TGFPHVGQSGLKLLTSGDLPTSGSQSAG 
ITGVSHCTWLEVI YFLNIS IYPS 


8547 


22448 


A 


8609 


278 


202 


QQPSTLRQDPPPAKRLHLSEGSDGH 


8548 


22449 


A 


8610 


236 


66 


AASTFFLQKIKRHLLSGTVAHACNPSTL 
GGRG I RI SRGRE FKTSLGNMVRPRVYKK 


8549 


22450 


A 


8611 


29 


177 


GKQVMALHCFIFFHFFFRRSIiAMLPRLV 
SNSWVQEILPPWPLKVLELQA 


8550 


22451 


A 


8612 


1 


215 


HTSRTLFVHLRLKNFNKYL IEWGLSML.P 
RLVLNS WAQAVLP PWP S KVLG I TGMSHC 
TWLYLHLSTDLLKLC 


8551 


22452 


A 


8613 


3 


91 


LRHGNCLDPGEGGCSEPRSCHCTPAWVT 
E 


8552 


22453 


A 


8614 


2 


167 


FTISLLGILVDRSLTHHINNIKPSFTRE 
NTLMFIHLSPILSTRKQLAIQHLSRLE 


8553 


22454 


A 


8615 


2 


281 


FCILVETGFHCVAHAGLQLLSSGNPPAS 
ASQSVGITGMSHCTSPNLSLLTQSKSFC 
LLIQNHNPYAFINSQLTFSILFLLSYFI 
FYDSLFLLF 


8554 


22455 


A 


8616 


282 


182 


GQAWWLTPVI PALWEAEVGGLLEPRS SR 
PAWST 


8555 


22456 


A 


8617 


379 


207 


S NQ T TKK YINFF F VEMR S H YVAGAGL KF 
LGS SNP PTSASQS AS ITGI SHYVWP ISP 
S 


8556 


22457 


A 


8618 


705 


483 


DKS FAL VARAGVQ WHHLGS LQPLiPPGFK 
QFSYLSLPSSWDYRHAPPRPANFCILSR 
DGVSPCWPGWSRTPDLR 


8557 


22458 


A 


8619 


359 


220 


RQSLTMLPRLVSNSWAQAILPPWLiSRLL 
GLQAiiAP I PGKS YTSTEKTP 


8558 


22459 


A 


8620 


3 


186 


HEVSWVKRKQDEWIEFDDDTVS IVAPED 
ILRLSAGGDCHIAYVLLYGPRRVQILEE 
ESEQ 


8559 


22460 


A 


8621 


1 


116 


GTSFCRDGGLTVLAQLGSSFWPQRILPP 
WPPKWEL.QA 


8560 


22461 


A 


8622 


369 


188 


MYSLEMGFIGNGHAGLELLASGDLSALA 
SESAG I TCMSLRYRTRFS FKRAT CGLDL 
SLQS 


8561 


22462 


A 


8623 


202 


22 


YILGFYFMPVICYQERKKDKKNWEVTTI 
MS FVG I CMELGAI X LS KLMQ EQKTKYH I 
FSQV 


8562 


22463 


A 


8624 


387 


315 


SNLGGGGCSELRL CHC TPAWATE 


8563 


22464 


A 


8625 


2 


135 


TRDLCSLVYLLTFPPLLSHDPAKSPSVR 
NTQE L S I KKKKKGR P F 


8564 


22465 


A 


8626 


3 


232 


HASAFE E P VY I KS RQ KRKE SNP P KLVS S 
QPHGLKKKKKKKKKKKKKRGRGLKKKKK 
KHNKKFKKGQKKYNFGRGAF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Afanine 
C=Cysteine, D=Aspartic Acid, 
E=GJutamic Acid, F=PhenyIa]anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W— Tryptophan, Y=Tyrosine, I 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


8565 


22466 


A 


8627 


97 


373 


QHGFL KKKKKKKKKKKKKGGP LLKKP LG 
GPNFTGDGKKKFFPKKGG I KKP PGEFLK 
KNLFLGGKKNGKNPPKKLNPLGEKKNFK 
GKKGKNPP 


8566 


22467 


A 


8628 


2 


188 


RYGCPP P YHTFEE P VYI KS RQKRKEWNP 
PKLVS S QPHGLHD FFKKKKKKGAAVLKD 
PSGGPS 


8567 


22468 


A 


8629 


287 


68 


GEEGRTQKGERNGGGGSERKEGEEHEAR 
RTGRGEGGGGEAKGRPRSARRRGRGSRG 
SKSQTPSRHLPAHI ITN 


8568 


22469 


A 


8630 


364 


136 


VSLLLGLEYSGTITAHYSLNFLLFIVET 
GSHYVAQTGLKLLGS SNP PTSDSQTFE I 
TGVSHGAQPKVQFWSKNLDT 


8569 


22470 


A 


8631 


355 


70 


LFPPRFFPFFSPLSPLKFFFSPKGFNFF 
REFFPIFS PPKKRVLSKNS PGGF YKP PL 
RGKIFTFPPPVKFGPPRGLFKGPPPFFF 
FFLFFFFFLVI 


8570 


22471 


A 


8632 


2 


266 


AFTISLLGILVYRSHLISSLLCLEGIIL 
SLVI IATVITLNTHSLLANIVPIAILVF 
AACEAAVGLALLGS I SNT YGLDYGHNLN 
LLQC 


8571 


22472 


A 


8633 


1 


186 


LTHHINNIKPSFTRENTLMFIHLiSPILL 
LKKKKKKKKKKKKKKKKKKKKGGALKKK 
PGGGQN 


8572 


22473 


A 


8634 


2 


80 


VLiE I AVAL I QAYVFTLLVS L YLRDNT 


8573 


22474 


A 


8635 


713 


333 


EGPPPPRSKKKGTQGGGKDPPSPPEPKT 
PGNPQGRKTGPPPPGRPYTGPRPPGSGP 
TRGGGSRS SSSNTNAPGEKI FFS KNPGR 
KIFPPRAILVFFSPFPLKNFFFSLRLLI 
FLGGCAPFFPPPK 


8574 


22475 


A 


8636 


389 


1 


FIFSFLFLIIFFFLNINFFFYIFSSLSF 
FFFLNFNLLLKHTPPI ILFFFSLSPIFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFGTKKELY 
LLRARGCKQYKNQKLIWYLTF 


8575 


22476 


A 


8637 


2 


192 


VRTLGLRTDAIPGRLYQTTFTATRPGVY 
YGQWYE I CGANHS FMP I VLELLALTTFE 
IGPEFTL 


8576 


22477 


A 


8638 


1 


87 


PTRPLE IAVAL I QAYVFTLLVS VYLHDN 
T 


8577 


22478 


A 


8639 


2 


132 


INLASTLIIFTILILLTILE IAVALIQA 
YGVTLL VS L YLHDNT 


8578 


22479 


A 


8640 


2 


113 


LFFIAEYTNI I I INTLTTTI FLGTTYDA 
LSPELYTTY 


8579 


22480 


A 


8641 


375 


22 


FIFISPLAYLFFLPFPFFFIPPLCFFFF 
IPLFFLFLSFLFSLIFFLFSLILIFPPS 
LFFFFPPPFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
F FFC WLPHR VMIAKVG I SWS KK I 




22481 


A 


8642 


180 


34 


W F I S D S LWLGMVAHT CN P S TLVGQGGW I 
TCGRSRPSVPKRTPSQTCAPA 


8581 


22482 


A 


8643 


338 


133 


NLPLGL F YSLQKKKKKPFLRPGTLGAPK 
GNFQGAPPLFGKFAFFFFFFFFPFFFFF 
FFFYGFFFFLFK 


8582 


22483 


A 


8644 


156 


287 


LTYPIKPPLLCSQKKKKKKKKKKKKKKK 
KKKKKGGAFKKNPWGAQI 


8583 


22484 


A 


8645 


1 


325 


LHHCHTHNF VNKWCF IRKKKKKGKKKK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 

USSN 
09/51 5,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
0=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K~Lysine, L=Leucine, M==Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S—Serine, 
T=Threonine, V~VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














ts.lss5.ilt £Lls.iS.iS\Sxss.iS. Jxis.i^iSJ^^^t'ljS^jS^lrif^isi^^ 
FSGGGKKKIFPQRGGYKKPPGGFLGKTL 
FLGGEKMGENPPKKIKPLGEKKIF 


8584 


22485 


A 


8646 


370 


3 


PVRASRLLCLPKQAWAMAGAPPPASLPP 
CSL I SDCCASNQRDS VGAGPSE PGAGYN 
L VMHC FLj S PS E KHS I WVG VTR F S RC C P S 
PL S LTRKGNS LT P CAS Q VRQ G LALLRLV 


8585 


22486 


A 


8647 


341 


88 


KKKKIFSPPIKRGPPSVYFICPPPPFFF 

FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFRMGCDRWHGELYNKI 

ir J. V\ il XS.iJ vjJL'Jj 


8586 


22487 


A 


8648 


1 


108 


PTRPRTRGVAS VLYFTT I L I L I PT I SL I 
ENKILKWA 


8587 


22488 


A 


8649 


3 


96 


EICGANHSFMPIVLELIPLKIFEIGPVF 
TL 


8588 


22489 


A 


8650 


457 


297 


SPSFFFFFFFFFFFFFFFFFFFFFFFFF 

TP XT' T~i IT* IT 1 TP TT 1 IP XT' TT* XT 1 TP TP TT" TP TT 1 TT 1 TIT 1 TP TP T TP"VTP 7> T T7» 

£ r J? r r r r J? r J? r r r r i? r r £ r r rJur jl r Alii: 
FIYLIIKDNYMFQC 


8589 


22490 


A 


8651 


84 


352 


YEKTDVKI I S I LFNTD YMLE YNVLH I LG 

QIKMLLISKKKKKKKKKKKKKKKKKINK 

tPcrAT/^r*T tt opt T r*T\TT vcnr c 7\ t vt c*na ~c 
c v V\J<j(jJj±Jj11j1ajJL V JjxoPijJiAJjKlSPor 

GFLEK 


Of QA 


224y[ 


7\ 
A 


oDjZ 


2 


111 


"\TNTT TTT3 Q U"'PTJ TPATTT Tv/T"C? TT-TT CDTT T T QT TVTTi 

JNJNXiViVoI? IKEilN ixjIYij? InjJjoP J.ljljJjbljJNP 
DIITGFSS 


Qjy 1 






OO.JO 


i 
i 




KyWUUvUUijKlNJLiKlJl? J_)i-iV XlNKLYll iliLi^ry 

RCVP S LHHRALDAEEEACLHS CAGKL IH 
SNHRLMAAY VQLMPALVQRR I AD YEAAS 


8592 


22493 


A 


8654 


129 


268 


LSEVLYLFPPKKKRLIILCLFFRNPGSS 
NLQKITKEPIIDYFDVQD 


8593 


22494 


A 


8655 


281 


117 


CI X TL S F INTLQAY 1 1 WLWF S E VCL YV 

QT GTVT CTVT Q T"V17'T?T?VDVT I/TC "CUT 
olio JL x ±jo X x JLjcj X jl V Jf xKx X rv.J.o.t 1 hLXj 


8594 


22495 


A 


8656 


3 


177 


DLHAHKLGVDPGNLNLLSHCLLETLAGH 
R 






Jrx 


OOJ / 


o 
3 


Z7U 


TGTRHLAWLIWGGFLFFVLRDVLVRFH 
ATDKD I PKTGQFTKERGLMDLQFHMAGK 
ASQS 


. 8596 


22497 


A 


8658 


107 


11 


QFEHTKPTPFLPTL IALTTLLLP IS PFI 

T T TT 


8597 


22498 


A 


8659 


21 


136 


VPIEHDVPVPNSRAELLKMFGIDRDAIA 
QAVRGLITKA 


8598 


22499 


A 


8660 


1 


395 


LVTAL YS L Y I FT T T QWGS LTHH I SNLE K 

WELLFFFLKGGIKKTILGFLGKTLFFGG 
GLLGAPLPRKIKGLREKKNFKGELGVKN 

"D^ riT 1^1 "NT T7Q QT ,CZ\7~VT TTtTV 

KVrrrr \jiW v dojj^jv I ijivtvx 


8599 


72^00 


A 


8661 


1 




RTRGRTRGRTRGLTRGKKKKKKIMKECK'K' 
KKKKKKKKGGAFLKWPWGGP I FWGLPKF 
YFLPNKGSFFNLIGDFLKRPFFWGGAYF 
G 


8600 


22501 


A 


8662 


136 


358 


FHIVKGVS I I KFKC VI LKKKKKKKKKKK 
KKKKKKKKKKGKKLKKKKGGKKKKGGGK 
NKIFFWVGFFYKRVWGYF 


8601 


22502 


A 


8663 


198 


368 


L INLVQFTC I LGTLLG I LF I YLFLRRAI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y-Tyrosine, 
XHLFnknown, ' =Stop codon, /=possibJe 
nucleotide deletion, \=possible 
nucleotide insertion 














TMLARLVLNSWPQAVLLPQPPKVLGLQA 


8602 


22503 


A 


8664 


407 


82 


KFLPHLGFPPFKRFFIKKEPFQWFFFLP 
GFFFFRRSFFLLPRVQCNGAISGPCNLC 
LNVKVRLSFKKKKKKCWDYTFEPLCPAL 
HFS F INKTEDCLF VCFTVYWPHGF 


8603 


22504 


A 


8665 


166 


293 


KI I AKHN I AR TNP YTF C I MKKKKKKKKK 
KKKKKKKKKKKKKKGGAL 


8604 


22505 


A 


8666 


1 


164 


RAKAKRRNTTVS CRMRHLKI VYRRFRYR 
LYVGI IIGPGGLNETLVFTCKKITVP 


8605 


22506 


A 


8667 


295 


85 


YIYMSHICTLSLSLSIYKRVKCFCFFKT 
WSHS VAQAGVQWRDLGS LQP P P PKW I LC 
ISVFFTVLFVMKKN 


8606 


22507 


A 


8668 


2 


131 


INLPSTLIIFTILILLTILEIAVALIQA 
YVFTLLVSLYLHDNT 


8607 


22508 


A 


8669 


423 


107 


HCALPKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKK 


8608 


22509 


A 


8670 


432 


301 


SQILGRFSQKQHLSLGGGGCSHPRSCHC 
TPAWARGRLRLKKF I K 


8609 


22510 


A 


8671 


1 


106 


LKLLTS SDLPAS AS QVAE I TGMSHCARP 
LSRIYMC 


8610 


22511 


A 


8672 


383 


252 


CPANFVFSVEKGFLHVGQAGLELPTSGD 
PPGCWFCFLQQQLHW 


8611 


22512 


A 


8673 


560 


135 


SSWGMDSIPTSSNMEETQQKSNLELLHI 
SLLLIESRLEPTOFLRSTFTNNLVYDTS 
DSDEYHLLKDLEEGIQMLMGRVEDGSHL 
TGQTLKQTYSKFDTNSHNHDALLKNYGL 
L P C FRKDMDKVE T F LRMVQ FR S VEGS CG 
F 


8612 


22513 


A 


8674 


2 


326 


SLLMCVHRCECVCMRACLCAGVCMCIAS 
CLGL PMNWE C YT WRVLVFHQ F QDEELH 
DTVDLET I PLERQ PRDVQHPVS TRIL YL 
HVYF VAVTLTL I R I LQLWTEAFS P 


8613 


22514 


A 


8675 


171 


272 


TQC CLLKAE KKKP KKKP KKKKKKKKKKK 
KGGPF 


8614 


22515 


A 


8676 


60 


409 


LVWLESHEDVGPLIKDSKQEKKKKKKKK 
KKKKKKKKKKKRGGAFKKKPGGGQNLRG 
GEKKNFFFLRGEKKNPRGNFGKKTFFWG 
GENWAKPPPKKKS PEGKKKILKGKGGKK 
YLNFLGE 


8615 


22516 


A 


8677 


106 


286 


LFWNKEGNNWKKKKKKKKKKKKKKKKKK 
KKKKKKKWGGALLKI SGGGP I FPGGGKN 
FFFFFGGGF 


8616 


22517 


A 


8678 


1 


97 


IIGQVASVLYFTTIIilLIPTISLIENKI 
LKWA 


8617 


22518 


A 


8679 


1 


141 


NEHQPGQHGEMPSLLKIQKISWAWWSMP 
V I PATQE AEME KESQISIM 


8618 


22519 


A 


8680 


499 


140 


GVFFFSRRKRCGVSPPPPPKPVFFPLPG 
FFFWWGGCPSAP P P PGGGFP PNP PRGFF 
YPPPLKGNFFFPPPAGGGPPRGFFWAPP 
PPFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFb 


8619 


22520 


A 


8681 


60 


224 


KQTKY I KE ITKH I CT WQGRYHDHEGGFP 
RVKL IHCTPDMLTP VI SPDVGNS TTV 


8620 


22521 


A 


8682 


252 


383 


SQXKKKKKKKKKKKKKKKKKKKXXGGGL 
LKNFWGGPNLTGGGKI 


8621 


22522 


A 


8683 


405 


145 


KFFFSGGCFFFWGPPGKNVPPPKKMFFF 
LIPPPKKKKPPPPKKIFFPPPGVFFPPP 
PPFFFAPPPFFFFFFFFFFFFFFFFFFF 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQID 
NO; of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 

ZD 


Predicted 
beginning 
nucleotide 
location 
co r res pond 
mg to nr»i 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
currebpuii 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylaianine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K = Lysinej L—Leucine, M = IVIethionine 7 

— As>pd.r*igiiic, jr — Jrroiuie, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 

n 1 1 /> 1 i\ t \ i i c\ c± A a\a4-\ s\i\ \ ' — rirkCciKln 
UUUCULIUC UcIcLIUIl, \ — [JUsMUlC 

nucleotide insertion 














FFF 


8622 


22523 


A 


8684 


1 


406 


LYYFLTKSTTTYLAVPQPFPPTPSTPSR 
AKKKKKKKKKKKKKKKKKKKKKKKKKKK 

X\.X\.X\.X\. xVi\.X\-L3\J7 VJ IT XV_L\_X\_ A. X\. XYVJf O VJJ VJJ \£ X\_ X\ XV XVVJXn. 

GPEKNRGGGETTPGEKKKKKKKGGGGKK 
KKKKSGGGKKNGGKPGRGGGGGR 


8623 


22524 


A 


8685 


391 


265 


GRPFFFFLDGVSLLLPRLECSGVILAHC 
NTiCIjPGPSDS PS S A 


8624 


22525 


A 


8686 


2 


177 


DTAIiYSLYIFTTTQWGSLTHHINNIxCPS 

FTP VMTT tiVIP T "f-TT » ^ P TT iT .T , ^ T /WPP) T T TiTF? 

ss 








R£R9 


47 Q 

*r / y 


JUO 


GL.PKCWDYRREPPHPAANFCI FRRDOV^ 
PCWPGWSCL 


8626 


22527 


A 


8688 


35 


161 


SNP P S SAFQVAGTTGVRHHDQLTFF FFF 
FLGGTGQTLCRGGD 


8627 


22528 


A 


8689 


1 


122 


THH 1 NN I KP S FTRENTLMF I HLS P I LLL 
SLNPDIITGFSS 


ooZts 


22529 








152 


MT TT 7± TTT'TCT T fTT T7VDCTXT TOOTT r~"T TP 
ftlT TT.QT.I7T TST'T.TVDT'DQTrQnV 


8629 


22530 


A 


8691 


541 


85 


IFLRQFLSPRLEYSGVIKAHCSLNLLGS 
SDPPASAFKVAGTTGIRHHAWLIFVFFF 

2k 77 TrJ.7P7-TV"X7Zi nnnT * W T iT .MT.PTvTT .PATS WD Q 

AGI IGISHHDLLGIQITTWLLQKLSYLQ 
KKICKDQETGLLHAFHLHPYPYPMPSSP 
QLLQGPDILTNG 


8630 


92531 


A 


8692 


122 


JO 


GR VDHS FMP I VLEIj I P LK I FE I GP VF TL 


8631 


22532 


A 


8693 


276 


77 


GVS P CE PVFFF I LGRCNP PAWGVPKGRD 
FFVLSFFFVVFYIL.L.FFFFFFFFYFFL.F 

r i v uu _ x/ x v v L7 xx, XJX-JX' r l x r i i. r 1 1 r xjx 

SFYCFNGSNK 


8632 


22533 


A 


8694 


3 


70 


FMP I VLELI PLKI FEIGP VFTL 


oOjj 




A 






i 
j 


uOvJl xr OjrxxrvjjxxXT o JQJr i xxJ_iilix\.iJa W i-ji-ix\ lurlri. 

DTSNSVHLLELCLRSQRHGRARWLTPVI 
PALWEASAGGSRRQEIEPI 






A 


ooyo 


AH 1 




DDPT)PPPPr!lQT7PP TJ3DTJTDQT.C; DD^ , Q'P^^ 1 IVTTi , 
xr xr xr Jr xr ir ir L.OO ILXrl^irxrJrlxrtDXjo xr irLyO r\.Jii v ix? 

VLALSQESQMTFFF FQLRKKLHLYL 


oOjj 




A 


OOy / 




^ 1 ^ 


IljXVXvJ\X\.XN.XVX>.X\.XXiVX\.XVl^ 

KKKKKKKKKKGGGGPKKKKFEAIFGKTP 
GGVSQGGGPPPLNFFFLNPGPPFF 


ODjD 






OD70 




no 


KKQT T . F F F Q t? YTTFGV <>[ A K T "NTR T P ^ T 

£\.X\.V7 X XJ X Hi V i—> J—J OX XXX IZi^J V 0-rt.Iv X ~M -L\. X. Ja. O X* 

VS VNKVARCIGE I FTDAVQS AFQKE VGG 
VS DS MVHKYE YAG C VD VKAVC SHGL 


8637 


22538 


A 


8699 


2 


164 


FLVETGFLHDGQAGLKLLISGDPPASAS 
QSAE I TGMS PHACNS STLGGRGGRI T 




99^^Q 


A 






Oil 


ppnc!PWATnGG'prT.FT.TPTTpn'nT,anz\PP 
RKLALLEGS FQGQAQWLTP VI PALWEAE 
ASGS 


8639 


22540 


A 


8701 


2 


170 


ARAGLYHVGQAGLKPIxTSGDPIxASASQS 
VRITGVSHRTRPKIIFKLIFFLFFFFFF 


8640 


22541 


A 


8702 


31 


229 


MEKYjNVHPHSGILRSHEKEQAALjFTIAK 
RWRQPS YPS IDE W INIMWS SHTVEYYTA 
MKRNKQLYS Q 


8641 


22542 


A 


8703 


3 


126 


DQELRDPTLTFRLLGSPRPVVVETRPVD 
DPTAP SNLYIQE 


8642 


22543 


A 


8704 


438 


31 


S EPCAGYNLLVCRFIiS PLE ICHN IRVGVT 
RFSRCHLSPLSLTRKGNSLTPCTSRVRQ 
CI^LWI^AHSALHPLSGMHCIxALPSEm 
PVPQMEMQKSPIFCIAHAGSCTPELFLF 
GHLGSTPLTHLLSLRIiWFLRTSE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ED 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=GJycine, H=Histidine, I=Jsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W==Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8643 


22544 


A 


8705 


429 


319 


PGAI I AHRS LNL PGSSDPPT S AS Q VAGN 
TGVSYCARP 


8644 


22545 


A 


8706 


426 


245 


RAATP S P I KLTKKKKKKKKKKKKKKKKK 
KKKKKKKKKKRGAP LKKTLGGPQ INRGK 
KKIFFFF 


8645 


22546 


A 


8707 


1 


108 


KHPRGGSFGYCFALAWVAFPIALVSGI I 
YIHLRKRE 


8646 


22547 


A 


8708 


3 


146 


Q I EGHT I S ALGDGAAWPVQGL I RHFRP E 
IiEE RMQR FAQ QHQ ARQAAS 


8647 


22548 


A 


8709 


2 


83 


LVLEIAVALIQAYVFTLLVSLYLHDNT 


8648 


22549 


A 


8710 


1 


81 


RPLS KT VR FNVLKVT KAAGT KKQ F QKF 


8649 


22550 


A 


8711 


156 


4 


RPLS FHPGGKKRLFLKKKKKKLCAVAHA 
CNP S TLGGRGGR I MRS GDRDH P G 


8650 


22551 


A 


8712 


440 


233 


TPSYFVLLVERGFSMLPRLVSIiTSGDPP 
ASAFRSAGITGMSHHAQPRTGRSKAYLP 
FLEEPDLKHIFPF 


8651 


22552 


A 


8713 


92 


302 


AMHPTMGPQDQGHTGGLYAHTPLHPHSH 
THANTLIHSHIYAHAHSHTIPATCPKGP 
TQPYLCLLSPHIEL 


8652 


22553 


A 


8714 


2 


166 


KKINSFFSQGLTQAKVHGGRPSSLGPQT 
PVLKRSSCLSFPSSWDYRHEPLYPAN 


8653 


22554 


A 


8715 


1 


129 


GFRHVGQADLELLTSGDLPASASQIVRI 
TGVSPCAQHEFFKSN 


8654 


22555 


A 


8716 


381 


105 


RRS LLHS VLNGAQAGVQWRDLGS LQP P P 
PSSLPWPPKOTRLQPLPGRHPVWEVRSV 
SARPPIVWDVRSPSAWLPSLESEERLCL 
AAIPSGK 


8655 


22556 


A 


8717 


385 


125 


FFFFLPPAFLLSGXFWFFFFYIFSFFFF 
FFFLGSPPPPIFFFFFFFLNKIFFFFFF 
FFFFFNNLFFFFFFFFFFFYFFFFFFFF 
FFF 


8656 


22557 


A 


8718 


409 


264 


KTDQRWSSTSSSKIMSQSQVSKGVDFES 
SEDDDDDPFMNPSSLRRNRR 


8657 


22558 


A 


8719 


3 


269 


KKHVKRVL S HLKAS CPEELLHFS S WPQL 
LELQLMGGGSHTHMDQHTHTHMHPHIYT 
HAHTHTWAKLNELP WVEMNLLS LGAFQ 
QRLP 


865S 


22559 


A 


8720 


430 


141 


L FF FLS FFEDGVLL CHPGWS WAQ S QLT 
ATSTS KKKKKKKAT P PGI PKFL I GKSGK 
PPRVLLIGHWAPFKFFFFNPRLAFF 


8659 


22560 


A 


8721 


386 


290 


RWGLTMLPRLVLNYWLQAILLLWPPKVS 
GLQA 


8660 


22561 


A 


8722 


3 


157 


HE VFX*VE TVFLH VGQAGIiELPTSGDLPT 
S ALWEAEWGGLLEDRS SGPAWET 


8661 


22562 


A 


8723 


493 


186 


FSPQGGEKRAILGLRULCPPGVKDFSAR 
PPKEVGMEGRVPKTKLFFVFLKKKGFPL 
IGRGGFKSRPWESPPRPPQKVGVQRLNP 
PPGPFFFFDLIWPNRNGS 


8662 


22563 


A 


8724 


3 


186 


GGGTPPFPGVWGKGNGGGTPPAKTLPPF 
PSPLFFFSPCPSPGEGGSREPLLSTPLA 
CICI 


8663 


22564 


A 


8725 


370 


67 


FAPPKNFYKGAPPIFFIFLFFILFIYFF 
FFFFIFFKFLVFLLETGFHHVSQEDLDL 
V I HL PQ P P KWGL QAWAT TPGQ F F VFL I 
ETGLHPVNQDGHNLLTL 


8664 


22565 


A 


8726 


966 


652 


LGSLKPPPTGFKRFSCLSLPSSWNYRHA 
P P CPANFVFLEETGFLHVGQVS LELLTS 
GDPPTLASQSAGITGMSHRDWLGQHLIV 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutam ic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine ? 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














mCYLFYQGHIKTTRSLLPPP 


8665 


22566 


A 


8727 


85 


425 


HMHS FAHE LHT F LHTHTNS HTNS QNQ S P 
TYTSKSHTHS YMRFTDSHDS QMQQTHRH 
S LLHMTQL YTCTHTHTHTRT VI PLHLHS 
VKGWLNT KVALGGRT S HGRE SHI AGRLL> 
A 


8666 


22567 


A 


8728 


393 


124 


LGSGEPQLFQSPPSGGPPGPKSHWNSWG 
NANPPVPPYIIESAPPPPHFPFGAQSKY 
SGGQAARLCPCPPFFLKKKKKKAPGSGT 
GTFSFQ 


8667 


22568 


A 


8729 


25 


217 


AVQ P I RVQWRR SLQSPQEQII LAP S LAK 
VDMEMTQLTQENADFATRDRYHHSSLVN 
REQLMPHY 


8668 


22569 


A 


8730 


1 


611 


PGIFYSALLSLDTS ILNQLCFIMHRYRK 
NLTAAKKNELVQKTKSEFNFSSKTYQEF 
NHYLTS MVGCLWTS KP FGKG IYIDPEIL 
E KTGVAE YKNSLNWHHPS FLS YAVS FL 
XiQTSSSSSSGSSSSSSSSSSSSSSSSSS 
SSSSSPSPEERTVNVSSIRGKKWSWYLD 
YLFSQGLQGLKLFIRSSVHHSSIPRAEG 


8669 


22570 


A 


8731 


120 


378 


VEKTQAS ILGCKCSTPRAALGFRERTLP 

T DDPDPPr 1 D'DTT I 7\ f^rv/TTPT "CITO 7\ T r* O C\7\ OTaTT 
J_iK.r<.^ .f ^ *w vj ir f LYlJi J_i rL i\±<J\Li\j 0 v™J " ->-» 

MPVFPALWEAEVGESLEARSFETSLGNI 
A 


8670 


22571 


A 


8732 


2 


68 


VYVCVCVCVCVCMCVCRAKVGM 


8671 


22572 


A 


8733 
.... , 


382 


258 


NYHSSLHLiETPGLKQSSCLSLPKSWDYR 

T_T"C OTl f~* Tl 7\ T T 'C'NTC 1 T 

HiL b P C P Ab I h rJS li 


OD 12 


225/ 3 




8734 


479 


326 


PPLYGFFFFFFFWGGGFFFFFFFYFFFF 
FFFFFFFFFFFFFFFFFFFQTTF 


8673 


22574 


A 


8735 


422 


2 


GRRPPGLYFFFFPPGKKNFQGRGGFLFF 
FPPKGFFFFFFPIGFFFFSSELGKDWPP 
PKKKVFSQRFPFFFFPPPLFLFLFFFFF 
FFFFFFFFFFFFFFFS FFFFFFFFFFFF 
FFFFFFFNKQVFIERLLCARHCFRPPAS 


OD /4 


225 fj 




8736 


7 


75 


JLAVALJLQAx vr lliliVibljYIjHDNT 


8675 


22576 


A 


8737 


2 


176 


NFGLLAETGFLYVGHAGLEVRSSGDDPA 
SAS QRAGI TGVS TAPGFNRYF YKQTI YK 
YG 


8676 


22577 


A 


8738 


408 


111 


GSSPPPRAGGENFLKKNAGGKNFPGGEG 
GGGFYPLYPKKFFFS PKGFI FWGGGGGK 

WPPGGNWGPPKNF 


OO / / 


22 j Jo 


7\ 

r\ 


5 /39 


0 


249 


VT?PT X(»"T/"T XJIVfO ACD I?nDPLm^ 7~KVLJT, r VVZi r TfV 

Y Jii^JjrLJlijnLYiyAis Pi 1 yKGrii^ VJMHKivi\.K.KK 

KKKKKKKKKKKKKKKKKKKKKKKKKKKK 

lLKj-J-iL±jKjK^rJS.r lor jtSAjIi AA^XV. vr J. 


8678 


22579 


A 


8740 


1 


226 


FFLRQSLSVTQAGVQWCDLSSLQLLAFQ 

Q>TQpnic;Tv QO\7ariTnr , r , T7TTTJUTr , iT.TTrTC 
vjoolNor 1 DfiayVHul 1 Vrlrlrl-LvjJLi i. r _Lo 

LVETGFHHVAQAGLELNLQ 


8679 


22580 


A 


8741 


4 


96 


DGLIMLPRLVSNSWPQGILPPWPSKMLG 
LQV 


8680 


22581 


A 


8742 


19 


112 


FGRILLVREKKKKKKKKKKKKKKKKKKK 
GGGPL 


8681 


22582 


A 


8743 


362 


234 


KMDQKCLSEVLQRWFPCCCCCCCCCCCC 
CCFSLPFIFEPSRS 


8682 


22583 


A 


8744 


3 


419 


LTLRRFQLNLTALTKKKKKKKKNLKKKK 
KKKKKKKKKKKKGGPLFKNSLGGPIFPG 
GGKFIFFFFWGGFLKPPGDFLKKTFFLG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D^Aspartic Acid, 
E=GIutamic Acid, F— Phenylalanine, 
G=GIycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GEIFGPPPPKNLTPLGKKKIFKGVGGKN 
P PL F LRRKKF F S GG F FKK I F S P GLG F 


8683 


22584 


A 


8745 


444 


1 


KKPSKE I LYPENSRFFS PLSPLKFFFS P 
KGFNFWGGGGPLCPPPKERFFSKNPPGG 
FFSPPLKEKIFFFPPPVNLGPPRDLFKG 
APPFFFFFFFFFLWEMFYWLSLSSGSQR 
LPQQVPTVEPSELGGMGAYFVSKRSTYL 
QRGWRPER 


8684 


22585 


A 


8746 


104 


222 


NKTFCLLKKKKKKKKKKKKKKKKKKKNK 
KKKKKNWGGAL 


8685 


22586 


A 


8747 


403 


201 


MEKYNWQPHSGILYSHEKEQAALFPIA 
KRWRQPSCLSIDEWRNIMWSSLTVEYYT 
AMKRNKQLYSQ 


8686 


22587 


A 


8748 


1 


90 


RTRGAVYAALERMGLDGCVEDLRSRLQR 
GP 


8687 


22588 


A 


8749 


444 


85 


SPPPPGLFFFFFFPKKKTSPPPTKKGFF 
SPPPPQKFFFFLKPFFFLGGLGPNFPPP 
KKNFFS KNPPRFFFFPPFKKKNFFFPPP 
FFFAPPKFFFLTPPPLFFFFFFFFFFFS 
FFFFFFFL 


8688 


22589 


A 


8750 


2 


309 


EFFPPYWEFLKINACMFSPEKKKKKKKK 
KKKKKKKKKRGP PLKKTPGGPQ I PRGGK 
KKIPPIiKGGQKKPPRGFLEKNPLFGGGP 
FGPPPPPKINPPEKKKNF 


8689 


22590 


A 


8751 


3 


76 


TASVSEGGGLQGITMKDSDEEEEG 


8690 


22591 


A 


8752 


433 


145 


SFFFFFFFFFSFFFFGKKSSFFTPL 


8691 


22592 


A 


8753 


2 


124 


GHLLMHIilGSATLAISTINLPSTLIIFT 
I L I KKKKKGRP F 


8692 


22593 


A 


8754 


80 


201 


F I NANS KKKKKKKKKKKKKKKKKKKKKK 
KKKKGGGLLKKTPGGAQF 


8693 


22594 


A 


8755 


415 


1 


IYFPTPEKFGPPKETLKKGAPFFFFFKQ 
KFPFFGPGGKQKGGFGSLQTLPPGKKKI 
SRPNPPRKRGFKDAPPPPGKFVFLKKKG 
VFPGGQGGFKTPTPRDLAPPAPLNPGAL 
FFFFLRQSLSLSLRLECSGTILAHAS 


8694 


22595 


A 


8756 


3 


136 


FTATRPGVYYGQ CSE I CGANHS FMP I VL 
ELIPLKI FEIGPVFTL. 


8695 


22596 


A 


8757 


80 


326 


KKFSLGSQGGRAGVFPRPPLGKLQTTAL 
NSGAKGTPPFLFPEPRKRGGPPPAPGWE 
GIFCPHFFPMGQKGEGGLWHTGEGPH 


8696 


22597 


A 


8758 


1 


164 


SLISSTQGHKQCRRPQGPLPRKTRDPCS 
HVYLLTFPPLLSYDPAKSPSLRNTQE 


8697 


22598 


A 


8759 


431 


79 


FFFFFIFLKKVSTQSPPPQFYFFFFLKT 
FFLFQPLLKEGGSWGGRGDKPPPLAPFK 
KWGVFFFFFSFFCCFVFFFFPLFFFFFL 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFPRCVLTRL 


8698 


22599 


A 


8760 


60 


194 


AR PTC PATAVTQ KKKKKKKKKKKKKKKK 
KKiCKKKGGGVLKKKQGGRKY 


8699 


22600 


A 


8761 


383 


38 


KKKK 


8700 


22601 


A 


8762 


399 


164 


HITVKSLLVPMDDPPKKKKKKKKKKKKK 
KKKKGGGFKKNLWGGQKLTGEKKKIFFF 
LKGGKKKPLGIF 


8701 


22602 


A 


8763 


353 


48 


NFFFKVFFFLKDFSHKGAPPKKGGPPKK 
TPPRKKFFFLKIKPLFFIAFFFFWSLFG 
FFLIFSLFFFFFFFFFFFFFFFFFFFFF 
FFIVFTITLMHFKIYLH 
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M 
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USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 
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correspond 

ing to first 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A—Alanine 
C=Cy stein e, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8702 


22603 


A 


8764 


2 


93 


FAMLPGLVSNSWPEAICLPWPPKLLGLQ 
MM 


8703 


22604 


A 


8765 


398 


63 


PPPPVFFFFSGGFFFWGGGGQKSPPRER 
CFFFKMHPGFFYIPPFVGKKVFFVLEGV 
VCPLGFFLSGGAFFFFFFFFFFFFFFFF 

TP "OTP TP TP TP l'""' TT1TTI TP IT" TP TP T7» TPT7 1 TP TT>TP/*~« DHOT TP TTT 

FFFFFFGFFFFFFFFFFFFGFPSVSHL 


8704 


22605 


A 


8766 


3 


83 


ILE I AGAL I QAY VFTLLVS L YLHDNT 


8705 


22606 


A 


8767 


450 


306 


Q KKKKKRRNKPE GLGT VAHACNP S TLGG 
RGRWITQPGQQEGNFISKY 


8706 


22607 


A 


8768 


1 


111 


WSLALVAQAGAPGFKQFS CLGLPKS WDY 
RHEPPCPGL 


8707 


22608 


A 


8769 


385 


261 


FLAEMG FCRVGQAG FKL LNS GD P PAS AS 
QSAGITGVIAPVL 


8708 


22609 


A 


8770 


3 


293 


SLGS KPLGLLSLS PVKWFR I FGKERNKV 
WGKDGGTDRNQS S SAFKHLQRGDSDPKQ 
NKI KACS S KF YLRRCVKRS FLLiI KKKKK 
KKKKKKKKKKKKKKK 


8709 


22610 


A 


8771 


2 


281 


C VCVC VCARVY I YIVYVY I C I YTYMCVY 
MYICIYIHVYIHVYTCVYVYIRIHMYIC 
ICVCIYTCIYTYIYTHTYKHSVYYLHNF 
YINPKLLQS 


8710 


22611 


A 


8772 


295 


161 


THTHTHTHTHTHTHTHTHTHTTCSLPPS 
LiAQLLGNCC I KAAI S IL Y I L 


8711 


22612 


A 


8773 


1 


318 


F F F F VRW S FT L VAQAR VQ WRDLGS LQ P L 
PPGFKQFSCLSLLSSWDFRHTPPCnANF 
VFLVE TG FLHVG QAGLE LP T S GDL P ALA 
SRGAGITGVSHHPQPPLCFLFL 


8712 


22613 


A 


8774 


3 


70 


FMPIVLELIPLKIFEIGPVFTL 


8713 


22614 


A 


8775 


369 


191 


TYKINKrGWAWWCAPLVPAAWKAEVEGL 
LEPGSLRPASVIQQDPHLKKGTYSRVLT 
THL 


8714 


22615 


A 


8776 


2 


124 


GFRHVGRAGLEFLTSGDPLASAFQSAGI 
TGMNHHVQP I VE 


8715 


22616 


A 


8777 


1 


162 


LKYYTADENGKTSRLLPQRPSDECGAGV 
FMASHFDRHYCGKCCLTYCFIKPEDQ 


8716 


22617 


A 


8778 


346 


138 


YNSPPYKEKTIPLQARVNFGPPRDSLKR 
PPFFFFFFKRQSLTMLPRLVLNSWAHGI 
LLPWPPKVQVLQA 


8717 


22618 


A 


8779 


2 


117 


LNLDTTAVQVRNYPRIRES YKVS FLSAL 
EEYTKKLNTQ 


8718 


22619 


A 


8780 


2 


97 


WDLTMLPRLVSNSWAQVILPSWPPKVLG 
LLA 


8719 


22620 


A 


8781 


403 


276 


I ISTVFQRGSCPI PSRKE VCSERPRRLQ 
KDSLTGFPHTVYKH 


8720 


22621 


A 


8782 


2 


119 


QAGLELLTSSDLPASATQSAGITGMSQR 
ARPHFQCKNVP 


8721 


22622 


A 


8783 


245 


101 


ELQFKMRS GWRRSQ P I SWGLTLVPRL I S 
NSWAQVILLTWPPKVLGIQA 


8722 


22623 


A 


8784 


2 


124 


AKL E KKKKKTGRAKRRMQ YNRR F VNWP 
J.}* GJxJxiS-tjPiN AIM o 


8723 


22624 


A 


8785 


3 


355 


VGP SEP GAG YNLL VCRLLR PLE KHGI R V 
GVTGFSRCHLSWLPLARKGNSPTPCASW 
VRHELALLRLTVCGLHPLSCTHCLTSPS 
EMNPVPQLEMQKS PVFCVAHAGS CRLEL 
FLFGH 


8724 


22625 


A 


8786 


238 


9 


PVKLGAGP PQAP FWLRKWAPLI KPFFKG 
PIGSLQIPLFFFLRQSLTLPPRLECSAT 
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M 
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nucleotide 
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ing to first 
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Predict- 
ed end 
nucle- 
otide 
location 
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ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PnenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possibie 
nucleotide deletion, \=possible 
nucleotide insertion 














WLTESSTSWVHAILLPQPPE 


8725 


22626 


A 


8787 


336 


10 


CLSLPSSWDHRRLPPHPASFCIS 


8726 


22627 


A 


8788 


1 


147 


IFYLLETGFHYVGQAGLELLTSGDSPAS 
ASQSAGITGVNHHALPEIHVF 


8727 


22628 


A 


8789 


3 


130 


DLEEGIQTLMGRLEDGSRRTGQILKLDH 
SSEFSKTRELYPVF 


8728 


22629 


A 


8790 


206 


3 


VPPSGP I KKGDKKKNLYLI FL I KNVPNK 
KLRKGVS PLAI P FFFFFFVFLLETGFHH 

VGQAGLELLTS 


8729 


22630 


A 


8791 


124 


390 


RAGAQSNLMEPPPSGVKAIFFPNFPKKW 
GNGT P P PAP L I FGGL KKKG VS P CGP GGS 
EPPAPGGPSPLAPPKGGNNGRDRLPPPP 
DKFLN 


8730 


22631 


A 


8793 


2 


301 


HE ERERE RERE RE RE RE RERE RE RE RE R 
ERERERDALFAFFPPRGARPPPEIERGV 
F VGAGS VRE IYPTLSIFFFSP P AHRGRE 
LRSRCVSHSLHLISRA 


8731 


22632 


A 


8794 


1 


285 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERGAPPP 
PLFFLGRGGALRAHAISPPPFLGAPPYI 
YIFFHMRARPPHRYLCAQRETRPAL 


8732 


22633 


A 


8795 


I 


321 


ARGERERERERERERERERERERERVSR 
RIPRGDLRDQFPHPLARSRDSFCGSADH 
FGRGVFNKVS WTD P PTHRVTS S LGGGV 
ERDLLTLSGGGTYAPWKNMCATGEDQ 


8733 


22634 


A 


8796 


2 


140 


HEERERERERERERERERERERERERER 
ERERE RERERE RE SLPPPAGCAKGAGWG 
E 


8734 


22635 


A 


8797 


2 


223 


SARERERERERERERERERERERERVSR 
RNLSRGGGGVPPPLQNVRIHSGGPARGT 
LHHI KKKTSLTDVGLAQ 


8735 


22636 


A 


8798 


1 


176 


ARGEREREREREREREREREREREKISF 
LGGGRGTQFAIERGVIQFCGRRYTTGRV 
RN 


8736 


22637 


A 


8799 


335 


27 


PPTKFFFFFFLVEMGFCCIAQGGLELLS 
SSSLPASAYQSAKI IGVSHHAWS VF 


8737 


22638 


A 


8800 


342 


191 


STSLSLPKCWDYRREPPCPANLSYFFKD 
P FS KCNPRLRYWGLGLQHRNFG 


8738 


22639 


A 


8801 


104 


332 


AVPLTMVKIQPLWKRVWRFHMKSKLELP 
CDPAISLLSMYPKEMKSLCQRDVCTPRL 
STGPLTIAGMWNPPKRSSMD 


8739 


22640 


A 


8802 


563 


258 


F FLNLETRS W YVAQARVQWLFTGTVTVH 
YSLDLLAS SHPP I S ASGVAG I TDACCAQ 
LTLPFIiKLGLLGLGGGGVGYPSYFGIiEG 
FHNDCSRVRVLCMWSYQ 


8740 


22641 


A 


8803 


2 


199 


ARGLLVRRFLS PS E KRS IRVGVTRFSRC 
RPSPLSLTRKGNSLTPCASQVRQCLALL 
RLAHGACTH 


8741 


22642 


A 


8804 


266 


108 


KLWSKCVALPPSFSSSCSSHIGCACFP 
FTFHYD CKFPQAS QAMtiPEQAVE P 


8742 


22643 


A 


8805 


1 


140 


KWWFKRPGVY YGQCS E I CGANHS FMP I 
VTjELIPLKIFEIGPVFTL • 


8743 


22644 


A 


8806 


2 


90 


S SAAAEE3STNDE KKEEAE KTEDDME FGLF 
D 


8744 


22645 


A 


8807 


229 


34 


KKKIFFFFPGFFWPPQKFFFKAPPPFFF 
FFFFFFFFFFFFFFXjDIjFIYLI ils ytk 
LISIRAAAPI 


8745 


22646 


A 


8808 


43 


173 


NADSGHAQWLTPGMPALWEAEAGGSQDQ 



1081 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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Predict- 
ed end 
nucle- 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C— Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline*, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *-Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EIETILAGAGKPRLY 


8746 


22647 


A 


8809 


259 


134 


IYMIFKNKFFNRDRGLPMLHRLVLNWAQ 
VILLSWPPKVLGL 


8747 


22648 


A 


8810 


391 


193 


VLPKIPGIQFPPPCRFKKRPRPRFKKPP 
PKRKKISFSNPPKIWPPQGYFKRGPPRL 
FYFFFFWIL 


8748 


22649 


A 


8811 


304 


3 


TTPPTNIFFCFLFFIFIFLWYFIFIFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
Q CF I GHDD QKMM VHTHT QTS SHIjLQGGS 
NPQGKGQS PGWVLWCA 


8749 


22650 


A 


8812 


395 


268 


FLVETGFHHVGQAGLELPASSDLPALTS 
QSAGITGVSHHGWS 


8750 


22651 


A 


8813 


782 


1174 


LSRLFYFCVLFCLYMKTTQLPYFRGLVC 
LFVLRQGLTLSSRIiEGSGMIAACCSLNV 
PGS TDS PTSASR WRTTGVRHHTQL I FV 
CFVEMWFHYVAQAGLEPLGS SS P PALS S 
QGAGITDVSHHTPLELjCF 


8751 


22652 


A 


8814 


95 


236 


ATMPGIiKNIYFLKNKDKGLTMLPSLVLK 
SWARVILLPWPPKVLGLQT 


8752 


22653 


A 


8815 


381 


58 


S CLLF p p fwgargggppraggsnppgp p 
GLiTPFPPKKPKNYWARGGGPFI pppreg 

waggfflprrgrvplapgsnnlelgplp 
s ppgyqkktpfs kkkkkekkkrkr 


8753 


22654 


A 


8817 


441 


158 


fffflrtdgyltmlprlvsnswpqaill 
pqppkmlgl 


8754 


22655 


A 


8818 


415 


342 


RLVLDSWAQEILLPWPPKVLGLQV 


8755 


22656 


A 


8819 


400 


171 


NFFFPPGVKFLGGGGPQFPPPQKRGFFQ 
KTPGGFFKPPPKKKKKFFPPPPKIGPPQ 
GIFKKAPPFFFFFFFFFFFF 


8756 


22657 


A 


8820 


2 


268 


INIIIAFTISLLGILVYRSHLMSSLLCL 
EGI ILSLFI IATLITLNTHSLIiANIVP I 
AILVFAACEKKKKGGAVLKDPWGGQSLR 
VLAR 


8757 


22658 


A 


8821 


2 


258 


LCLPNQAWAMAGSPPPASLLPCSLISDC 
CASNQRDS VGVGP S E PGVGYS LWRRFL 
SPSEKRSIRVGVTRFSSWVRWLRTVIPA 
T 


8758 


22659 


A 


8822 


250 


456 


GGGDKFGLIETFPPGLKPFFLLNLLSGW 
ELGPLAPPPFKFCFFKGRGFPFLPRFVF 
VANLLLTCCKKRD 


8759 


22660 


A 


8823 


492 


376 


QENCLNPGGRICSELRSCLCTPAWATER 
ACLKKQNKTNQ 


8760 


22661 


A 


8824 


1 


116 


LTPLPPSAPPSVDDNLKTPPEWVCSLPF 
HPQRMIISRN 


8761 


22662 


A 


8825 


2 


191 


FLVETGFHHFGRAGIiKLLNSGGALTSAF 
QRAGITGVITGVSHCFEVNRVWTGTQYF 
LCYILKS 


8762 


22663 


A 


8826 


393 


75 


PRFFPPPPPKIFFFPPPPFFFLGGFPPI 
PPPPKNFFFPRPPPGFFFPPPLKKKIFF 
PPPPFFSPPPNFFFPPPPPFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFS I 


8763 


22664 


A 


8827 


81 


380 


KI ITKHNIARTNPYTFCIIVINYLKKKKKK 
KKKKKKKKKKKKKKKKKKKKKGGGPFLK 
NPGGGQNNPGGKKNFFFFFGGGKKWPPG 
FFKKKPFFGGGKFGAPPPPKN 


8764 


22665 


A 


8828 


354 


13 


TKPKTPPLTKGPRRGFLAPRVPGGGGP I 
TP P PQNFLGEKGTLARKKVFP PGRPAPK 
RP PRKKNGFEMRPLKKNGPGPWKFLGFF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HNHistidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IANPPGFGPKKKKPENQTHKKKCQRRKP 
RF 


8765 


22666 


A 


8829 


1 


215 


KHSTTHSHCPRTIKLKKKKKKKKKKKKK 
KKKKKKKKKKKKKKGGPPFKKTLGGPPF 
SPGGKKKFFFFLGGL 


8766 


22667 


A 


8830 


1 


127 


ILFFWQRRGLALLHRLVSNSWAQVILPP 
RPPRVLGLQVIYFK 


8767 


22668 


A 


8831 


264 


55 


FFFXFFXFFFFFFFFFLFVFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFWCS CS 
YVLIFIQPFIKSFH 


8768 


22669 


A 


8832 


399 


85 


PPQKNFFFPQAPKISGGGGPQIAPPQKK 
GS FQKTPGGFF I P PQKKKKNYFPPPGKM 
GPPQGFFKRPPPLFFFFFFFFFFFFFFH 
NRHLFKVKHLQNS FHLVQLRL 


8769 


22670 


A 


8833 


47 


288 


NCLYRTKKKKKKKKKKPPYKKKTLRGPN 
FPPAGPPAPLPLSGGEKKPSRGLLRRPP 
TLGGAAKGP P P P P KLTPLRKKKI F 


8770 


22671 


A 


8834 


1 


146 


IN 1 1 LAFT I S LLG 1 LVYRSHL I S SLLCL 
EGI I LS LFI I ATL I TPSTLC 


8771 


22672 


A 


8835 


2 


79 


EDPKTSPKPRI IQTRRPGLPPSVSN 


8772 


22673 


A 


8836 


1 


156 


PTR P T 1 1 TP I LLTIi FIj I TQ L K I USTTN YH 
LPPSPKPIKKKKKKKKKKKKGGGP 


8773 


22674 


A 


8837 


152 


292 


AKTFNF YKVEFINVFFNGLCLIjI I KNTL 
PKKKKKKKKKKKKKKKKKKK 


8774 


22675 


A 


8838 


1 


122 


VASNS IHPKAKENTAPHTHTHTHTHTHT 
HTHTHTHIj FMA. I FK 


8775 


22676 


A 


8839 


2 


106 


HVGQAGLELLTSGDLPTSASQSVGITGV 
SHRAWP 


8776 


22677 


A 


8840 


364 


224 


APLCGRQICDCDCTYPSPHTYIHTHTHT 
HTHTHTHTHS 


8777 


22678 


A 


8841 


266 


1 


NSLSVEFLILFPPYNYSPKQCICSHQNC 
LNFWVIKCVNTPHPNFLNLRMPLiLAKRG 
AHACDPNTFGGGGRWITRSVDRDHPGQH 
NETP 


8778 


22679 


A 


8842 


343 


52 


KTGFRHVGQPGELIiASSDLPASASQSAG 
ITGVSHRVRPGLH I LDNSS FLDTSFAD I 
FSVLYLKAGIASLLHIYIHFLLPLRDLL 
LVSSLTVSFPTAV 


8779 


22680 


A 


8843 


256 


35 


IiFIFSLSLQYIFCVCGCVCVFMSLCFLV 
CAWGHVQLHVCS CVRMCVCENDVCVCGW 
VQMCFCIKNFQKEVYQI 


8780 


22681 


A 


8844 


333 


11 


PPIKKKAPPPKGRAFFFFFLKKKKGGPP 
PPQKKTGGGGPKKRGGVKKPPPGFFPGF 
FGPLFFWGPPFLPPFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFNFISTKNLF 


8781 


22682 


A 


8845 


15 


288 


RCGLTVLPMLVSDCWAQVILLAQPPRVL 
GLQAGASASTPITSVNCSQGAIYPEVGT 
TGSTI I AAAS SGGEFDSVLQND I CMCFL 
TQQFHF 


8782 


22683 


A 


8846 


1 


397 


ECAHHTQIiI FLFL I E SSLHHVGQAGLKL 
P I S S D P P T S AS Q S AGTTA V S HHA 


8783 


22684 


A 


8847 


1 


179 


ARERERERERERERERERERERERERER 
EREREREREREREREREREREKKKLGGG 
GGGPQFPIGGGLTQGGGGG 


8784 


22685 


A 


8848 


2 


233 


HEERERERDRERQRERERERERDRERER 
ERETFFSSSGGGPKETTLFFETGAPKEG 
YQNMHARLI PSAVPEKKRGRG 


8785 


22686 


A 


8849 


2 


180 


HEERERERERERERERERERERERERER 
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Amino acid sequence (A=Alanine 
0=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G]ycine, H=Histidine, I=Iso leucine, 
KNLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y— Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














ERERERERERERERERDFVFYRWGEGHI 
DRQYCSFLCPGWCHCPM 


8786 


22687 


A 


8850 


2 


79 


PEERERERERERERERERERERERERER 
ERERE 


8787 


22688 


A 


8851 


1 


178 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERARLFARGEK 
KKHTGGGPPLFYYLYI 


8788 


22689 


A 


8852 


3 


76 


TRRERERERERERERERERERERERESP 
A 


8789 


22690 


A 


8853 


5 


227 


E ERE RE RERE RE RE RERE RE RE RERERE 
RERDFFFFFLEKKYL.FSPIGELKGRGGG 
SLTPLLTGKWNPCPPPPH 


8790 


22691 


A 


8854 


1 


189 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RHS RL F P RE S VC VARAGG AS S P PRG 


8791 


22692 


A 


8855 


2 


271 


HEERERERERERERERERERERERERER 
EREREEPPRGEKYCIRVPPQKILSPACD 
SLLYVTHNLCVSHSTPRGERYVALFNSR 
ETRCF 


8792 


22693 


A 


8856 


2 


309 


HEQQQQQQQLRNLRDFLLVYNRMTELCF 
QRCVP S LHHRALDAE EE ACLHS CAGKL I 
HSNHRLMAAYVQLMPALVQRRIANYEAA 
S AVPRVAAEQ PGVS PS GS 


8793 


22694 


A 


8857 


44 


203 


G YS TC VGMHAHTH I HTDTHTHAHT Y IHQ 
CLIQIGLCTFATCTSQVNKSSMAEL 


8794 


22695 


A 


8858 


329 


182 


I IFVCLVERGLHYVGQAGPELKASDDLP 
ALASQSAGTSGVSHCVSSKMS 


8795 


22696 


A 


8859 


1 


175 


GLTLPPRLGCHATITAHYSLNLPGSSNP 
PTSAS QVAGTTHSRP CPE I TQTALQRGP 
HD 


8796 


22697 


A 


8860 


3 


350 


HEAIGLRSNIKNGLDHFLPLGTPTPLIP 
ILAMI ETILLLVQP I ALA VRL TANI TAG 
HLLMRLNGSATLAISTISLPSTLIIFTI 
LILLT I LE I AGAL I QAYVFTLLVGLYLR 
DNA 


8797 


22698 


A 


8861 


139 


3 


TFKEAD I KGGGPPHLAS FC I FGRDGVS P 
CWPGWSRTPDLKRSTRA 


8798 


22699 


A 


8862 


3 


134 


SPTPPPSSKPSSIPRKSSVDLNQVSMLS 
PAALSPASSSQRHES 


8799 


22700 


A 


8863 


346 


219 


RLVP P CLDNFLNF CRYS CLTML PGLVSN 
SWAQVIFLPQPPKD 


8800 


22701 


A 


8864 


1 


147 


GTRFFFERGIDHVGQAGIQLLTLGDPPT 
PASQRAE I TGVSHHTWPNF I F 


8801 


22702 


A 


8865 


267 


132 


KKKFWPFFFFFLRDGVSLLLRLVSNCWA 
QEILLPWPPKLLGLQV 


8802 


22703 


A 


8866 


343 


215 


LAQGLRTPI PKRVPPHTQKKKFCSERPT 
RIQTDRSCWSPLS 


8803 


22704 


A 


8867 


1 


242 


GTSKKIDKLDYIELKCFCTAKETISRVK 
RPVE WEKI FANS S PE KGL I S R INKE AKK 
LNSSAIAAHNCNPSTLGGQVRRIT 


8804 


22705 


A 


8868 


1 


219 


GTRRP T WAT WQNP AS TKKKKNS F F FETE 
SRTVPQAGGQGGNLGSLQSLPPGLKRFS 
GLiSMPGAPKGQSGSLMG 


8805 


22706 


A 


8869 


1 


181 


ATGG I LLLLD WS LAYE S KHLLEGAKSE 
SAEELKKRAQELEGKLNFLTKIHEMLQP 
GQDQ 


8806 


22707 


A 


8870 


1 


255 


AAAPPNAPGGPPGPQPAPSAAAPPPPAH 
ALGGMDAELi I DEE ALTS LELELGLHRVR 
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SEQ ID 
NO: of 
nucleotide 
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SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
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09/515,1 
26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=G!ycine, H=Histidine, I=Iso leucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ELPELFLGQSEFDCFSDLGSAPPAGSVS 
C 


8807 


22708 


A 


8871 


383 


233 


ATTFPNPGFLFVRLFVLVQWPDHEPPGF 
KQSSHFSLPSSWDYRHPPPPPG 


8808 


22709 


A 


8872 


211 


294 


AXKKKKKKKKKKKKKKKKKKKKNI IWGG 


8809 


22710 


A 


8873 


398 


100 


QKERPPGFPPFKPGPPQKRGGKGAPPKD 
GEH Y F FW VLE KS TG F P PKTKRVY YPS P V 
YPHPPAPQKEGFPGGSPPPRPIASPFGG 
KKKEKKSYIEKENLF 


8810 


22711 


A 


8874 


3 


214 


DAWVLGILVYRSHLISSLLCLEGIILSL 
FIIATLITLNTHSLLEKKKKKKKKKKKK 
KGGAFKKNPGGAQ I 


8811 


22712 


A 


8875 


2 


254 


PRVRPEGRNRTPQSWFQANPMACMTFSK 
KKKKKKKKKKKKKKKKKKKRGGGFIKKM 
GGGKNFREKEKKNFLLIRGKI KKTGGDF 


8812 


22713 


A 


8876 


2 


135 


TINLP S TL. 1 1 FT I L I LiLT I LE I AVAL I Q 
AYVFTLLVSLYLHDNT 


8813 


22714 


A 


8877 


2 


296 


IiARGAEVTiGYGSHS RGRVPGALVGQGAG 
RLFTEHPGSSPATLAIYPTPEGTSVAVS 
ISAPPKARSRPYPPSRSCHNLSLAGSSV 
LiVPPGPVHRWVGRP 


8814 


22715 


A 


8878 


164 


29 


RKIGPARWLTPVIPALWETEAGGSRGQE 
I E I I LANT VKPGACE VL 


8815 


22716 


A 


8879 


381 


219 


GEKMTEEEVHMLVAGHEDSNGCINYEGK 
R 


8816 


22717 


A 


8880 


2 


156 


ANGNS FATRLSNI FVIGNGNKP W I SLPR 
GKGIRLTVAEERDKRLATKQSSG 


8817 


22718 


A 


8881 


3 


402 


SGFNIEYAAGPFALFFIAEYTNIIIINT 
LTTT I FLGTT YDALS PEL YTT YFKKKKK 
KGGRPFFFFFFLKKKKIPPPKKKNFGKK 
GETLKGRGGAP I FQPKKILS PPKKKKKK 
RGGAFKKS LGGAKFNGGGRN 


8818 


22719 


A 


8882 


1 


406 


RYSTPSEGEVGERYSTPPGETLERYSTP 
PGETLERYSTPPGETLERYSTPPGETLE 
RYSTPPGETLERYSTPPGEALERYSIPT 
GGPNPTGTFKTYPSKKKKGGGRFKEPLG 
G P RFTGVGRVKVFS LWGGVLNAR 


8819 


22720 


A 


8883 


484 


307 


KKILNRRVRWLTPVIPTLWETEAGSSRG 
QEME T I LGNTVKP PAS ASQ S AE I KGMR I 
IIF 


8820 


22721 


A 


8884 


1 


131 


GS LTHH I NN I KP S FTRENTLM VI HLS P I 
VLLSLNPDI ITGFSS 


8821 


22722 


A 


8885 


406 


106 


KGGAGPKIAPPKKKAFSPIPPPFFWPPP 

T7T~i T\T\T\T~\ (~* T1 T?TvTTrl WO T*l TIT T T^ITH '[ H l"> /"< T>T Tin 

VFLKGPPPPSPFJNFFPPP V i> FFRGPLFF 
FFFFFFFCETWLPFRPSNLVFTLPHVF 
AWLI P IDQYLF I SHS C 


8822 


22723 


A 


8886 


425 


325 


SSTHTHTHTHTHTHTERSHLLAFEPSAG 
ELiAIMAJyi 


8823 


22724 


A 


8887 


394 


278 


VCVCVCMCVYICIYPCWLPCWHMCLELY 
ivvr jxw Jl l v xjS.\j v 


8824 


22725 


A 


8888 


2 


86 


YSRTRAAALIQAYVFTLLVSLYLHDSAY 


8825 


22726 


A 


8889 


400 


62 


FLYLKSFPPPPERGVFSPLPPQKFFFSP 
KALNFLGGGG PKFP P P KKRFFFK YP P RV 
FLYPPHKKKNYIFPPPDDFGPPRVFFKA 
PPPHFFFFFFFFFFFFFFFFFTRLEWVQ 


8826 


22727 


A 


8890 


3 


100 


SLEPGTSGLKGFSCLTLPSSEDYRHEPP 
CPAN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=ProNne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, -=Stop codon,/=possibte 
nucleotide deletion, V=possible 
nucleotide insertion 


8827 


22728 


A 


8891 


32 


142 


WPTRPGQVAS VL YFTT I L 1 1* I PTI S L I 
ENKILKWA 


8828 


22729 


A 


8892 


3 


382 


TPHNPLSRPSYEPKPASTPSRAPKKKKK 
KNPKKKKKKKKKKKKKKKKRGGALLKKT 
LGG P KL PGG KKKNF FF FRGGKKKP PGD F 
IjKKTIjFIjGGGNLiAP P P P KKKN P WGKKK I 
FWGEGGKKNFFFFG 


8829 


22730 


A 


8893 


368 


199 


KKKRI S PPSRKRAPQGDFLRPTPP I KNL 
YYYLSFFFLILYIFFFFFFFFFFFIFFF 


8830 


22731 


A 


8894 


3 


136 


TINLPSTLIIFTILILQTILEIAGALIQ 
AYVFTLLVSLYLHDNT 


8831 


22732 


A 


8895 


1 


144 


I AT INTiPSTL 1 1 FTTL I LLT I LEIAGAL 
I QAYGFTLL VS LYLHD YP YN 


8832 


22733 


A 


8896 


153 


375 


PQKKKKKKKKKKKKKKKKKKKKKWGGGF 
LiKNSGGAPIFGGGGKFFFFFLGGGFKTP 
RGFFGKNPFFWGGKKKKN 


8833 


22734 


A 


8897 


332 


400 


WLTP VI PXLWDAEAGGS FEVRS S 


8834 


22735 


A 


8898 


215 


359 


TEL V PNAVL FCPLPCSS GHHLIjCAI HAK 
RVTIMPKDIQLARRIRGERA 


8835 


22736 


A 


8899 


414 


73 


PRLLCFHKKAWAMAGALPPALLPPYSLI 
SDCCASNQRDSVGVGPSEPGVGYNLWR 
RFLSLSEKRS IRVGVTRFSRCCPSPLSL 
TRKGNS LTPCTSRIiRQCLP LLWLAHGAR 
TH 


8836 


22737 


A 


8900 


374 


88 


VSPPS PLKI FFS PKAFNFWGGVGPHFPP 
PQKRVLPQKPPGGFFSPPLKKKKFSFPP 
PVKLGPPKGSFKGPPPFFFFFFFFFFGIi 
DFMYTGS SNVW 


8837 


22738 


A 


8901 


433 


56 


PEGPPPPGGFYPGKRGFFPTPLWFGGP 
KRTJPPPRGKKNPSWRSGGKQTPRFPRDP' 
LiLKAFFPGGPPQRLGKAQGPFWFFFGPK 
KGKTQTGPPFFFKKLGDPWGPKKKPPPG 
GAPRLGPPKKKFL 


8838 


22739 


A 


8902 


3 


73 


IAVAL IQAYVFTLLVSLYLHDNT 


8839 


22740 


A 


8903 


3 


149 


PNLSYI IGKDTWVEHWPEEDE CQDEENQ 
KQCQDLGAFTESMWFGCPN 


8840 


22741 


A 


8904 


351 


137 


FPPEFFAAPKKSG IL FFS KTWMEVE VLF 
LSQLMLGQKTQYCMFSLINGRIillKAPG 
SLKGDHHPPLGLLEG 


8841 


22742 


A 


8905 


142 


5 


KI FLQ KKKKKKGLGT VAHACNP S TLGGQ 
GGWIMRSRDQDHPGQQW 


8842 


22743 


A 


8906 


367 


244 


HVPQARAKFFC I FSRDGVS P YWPGWSRT 
PDLMICLLFIVEL 


8843 


22744 


A 


8907 


146 


248 


S Q I LGRLRQENCLNLGNGGCS E PRWCHF 
LAWAKE 


8844 


22745 


A 


8908 


2 


197 


AAASTHHARLFLCVCLFAYLLVEMEFHH 
VAQAGTE LLDS SNP P PLASQNAG IIGMS 
HHTQPQYIC 


8845 


22746 


A 


8909 


1 


127 


KKTRLERAQWLTPVI PAFWNAEAGTLLE 
ARSLRRLjPWPPNVLi 


8846 


22747 


A 


8910 


368 


280 


LTMLLRLVLNS WAQAI LLTWP PKVLGLQ 
A 


8847 


22748 


A 


8911 


355 


211 


QQQNLHLFWPGTVAHTRNS STLGGQGMR 
I TGAYE FETTLGNTARPHL Y 


8848 


22749 


A 


8912 


325 


190 


KKKKKKGGGGPQKKKKNPPPPPTSKKKK 
KKKKKKKKKKKKKKKKKKKKK 


8849 


22750 


A 


8913 


1 


411 


VNVPAPRGGAYRGRQASFS SGGLHPVPA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine 7 L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V*=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possibJe 
nucleotide insertion 














YRPLCLPNQTTNSAMAGAPPLASLPPCS 
LISDCCASNKRDSMGVGPSETGTGYNLL 
VCHFLSPLEKCSIRVGVTRFSRCRLSPL 
SLTKKGNSLTPCTSRVRRCLAHLRL 


8850 


22751 


A 


8914 


258 


9 


GAI P CLHCETNPSHI S S TQELQTPEPQL 
PGVPLELP P PGS CFKCRKSGHWAKE CPQ 
PGIPPKPCPICAGPHWKLDCPTGPRL 


8851 


22752 


A 


8915 


326 


42 


EKIHRKYFCYSFFVFLVEMGFYFVGQAS 
LKLLASSDLPAS PSQSARVKAMSHCVRP 
VLVIFKNKILQNTENSTKIYVYPPPNNQ 
ILTFCHFCFK 


8852 


22753 


A 


8917 


3 


106 


GLE L P TS GD L PAS AS QKAG I TG I S HHAR 
PIMTYS 


8853 


22754 


A 


8918 


320 


171 


AASTTDGSYKCLCLPGYVPSDKPNYCTP 
LNTALNLEKC P FGLPHLSGS S 


8854 


22755 


A 


8919 


1 


201 


VKPSDRYHLMP I FTAAYPHQNSTYNVPV 
STRMVMAEEFKQGKRLSLCSDLYRKDLR 
TIVDPWSCAT 


8855 


22756 


A 


8920 


3 


182 


LLRPLEKRG I RVGVSRFS R YHLSRLP FA 
RKGNS PTPCASRVRRCPSLLHGLHPIiSD 
KPQ 


8856 


22757 


A 


8921 


142 


1 


SCHPGWSAWRPQLLRRLRYENCLSLGG 
SGCSELGSCHCTPAWTTEG 


8857 


22758 


A 


8922 


1 


117 


LFFNYAWGWSTTLLSRLSLNSWVHVILT 
PWPPKELGLQA 


8858 


22759 


A 


8923 


357 


121 


LGAFSDGLAHLDNLKGTFATLSELHCDK 
LHVDLENFRLLGNVLVC VLAHHFGKE FT 
PPVQAAYQKWAGVANALAHKYH 


8859 


22760 


A 


8924 


343 


133 


LFFNFNFLETRLREWLIFVFSVETGFPI 
KRAGLKLQAS S DP PARGS QSAE ISGMSH 
YAWP CKSNRAFKCS 


8860 


22761 


A 


8925 


321 


44 


PDSGGS PASAS QVAGTTGACHDAWLMF I 
FNMD ININI P FS AKRDKLSMDKINE 1 1 F 
SLLINRISSFQIMYFLCSSRIESRKRWL 
KQTGRKKE 


8861 


22762 


A 


8926 


335 


106 


HFVFLIETGFHRVGHAGLELLTSTDPPA 
LAYKS SGI TG VS YRS FT KLHL CACV CL C 
IYTYLGLVLSFTPGSWEKKT 


8862 


22763 


A 


8927 


2 


160 


LSRVPPRSANLS I S VFAEKRCHFTMLPR 
PISNSWAQGILLPSQAPKVLGLQA 


8863 


22764 


A 


8928 


3 


140 


RVITEEEKNFKAFASLRMARANARLFGI 
RAKRAKE AAEQD VE KKK 


8864 


22765 


A 


8929 


126 


59 


NSGGGGCNELRS CHCTPAWATE 


8865 


22766 


A 


8930 


3 


104 


KNMAC YCRI PACI AGERR YGTC I YQGRL 
WAFCC 


8866 


22767 


A 


8931 


311 


112 


MENYLSKMQQELEKNITRELKEAAAELE 
SGSIASPLGSTDESNLNQDLVWKASREY 
VQVLKKNYMI 


8867 


22768 


A 


8932 


345 


134 


GGGSSPWFPPFWRPRRADSLRVGVLNPP 
GPGGGAP FFLKNPIi I GWGGGARLWFQL I 
LLRVRPENS FDPGR 


8868 


22769 


A 


8933 


262 


39 


RPRRPHPGNFFFFFNSGSHYVAQAGLEL 
LDSSDPTLPSQSAGTTGMSYQAWPLDHT 
FENRDCVLLHCSSPVSNT 


8869 


22770 j 


A 


8934 


84 


22 


TPAWVWWLTPVI PALWE AKAG 


8870 


22771 


A 


8935 


277 


111 


ERI INHAAGSHGVSGIFMKYDLSSLMVT 
VI E EHMP F WQ F F VRLCGI VGG I FS S TG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8871 


22772 


A 


8936 


2 


106 


DSAI PRRLECLKEDVQRQQEREKELQHR 
YAEFVF 


8872 


22773 


A 


8937 


1 


193 


LMWDFSPSGLDGAFQRGYYWYYNKYINV 
KKGS I SGFTMVLAGYMLFI YCLS YKELK 
HERLCKYH 


8873 


22774 


A 


8938 


240 


69 


DSGNQLKRSFSGRARWLTPVIPALWEAE 
VGESLEVRNIIYILQNMSNIGNIHLIIK 


8874 


22775 


A 


8939 


329 


377 


RWNLAMLPRLVLNSWAQAI CPS WPLKML 
GLEAEPS CPARTNFKVTS PRLH 


8875 


22776 


A 


8940 


358 


120 


H P S N F F VFL VEMG FHHVGQ P GL E L F PAR 
YVP TjLAFRSGG I TG I SHCAWP KE WALPR 
KAS PGLSGRRGS PHCATCSS KAW 


8876 


22777 


A 


8941 


333 


104 


PHVYSNGHICLSILTEDWSPALSVQSVC 
LSI I SMLSSCKE KRRPPDNS FYVRTCNK 
NPKKTKWWYHGKQFEFVITF 


8877 


22778 


A 


8942 


1 


241 


APLVAFTQVNLEDKGGLSKLVEAIRINF 
NDRYDE I CHHWGGNVLGP KSGTRI AKLK 
KAKARELATNWI KCTLLS FLYIKR 


8878 


22779 


A 


8943 


1 


240 


APLVAFTQVNLEDKGGLSKLVEAIRINY 
NDRYDE I CHHWGGNVLGPKS VTRI AKLK 
KAKAKELATNW I KCTLLS FLY I KR 


8879 


22780 


A 


8944 


306 


150 


AAS TLCPRLE CS GAI MAHCS LKTLLGS GD 
PPASASQVGGARGSLEPGRWRLQ 


8880 


22781 


A 


8945 . 


284 


2 


GKSRRSPLDLCTSGFTGSTHFTLIICWV 
PRYVATCPPIGLNFVFIKTWWSHYIVQA 
DLKLMVS SNP P VS AS QS AGNTGMSHHT W 
LENRLLTTKK 


8881 


22782 


A 


8946 


1 


226 


WS FAL FAQ AGLE F LT S GD P PAS AS L SAG 
I TGM S HHAQ P AVS LNS I F S AS T F CAT S S 
YCAKMELKELKFML I VDAA 


8882 


22783 


A 


8947 


363 


96 


NTAPGYMPLLS IPFLVLSKAVFLFLLGM 
FTAALLI IAQRWEQPKCASTDEWINKJyiW 
HMHTMEHYSTCCGVHAAVNISIQISEWN 
CWEW 


8883 


22784 


A 


8948 


1 


208 


GRVGQ P GLRLLT S GD L PAS AS Q S AGVAG 
VSHRAQPTPALNCVLTCWVLQHYIKHNH 
GKRS E YFTI FNDM 


8884 


22785 


A 


8949 


69 


298 


SRGHQRVADAALLS FPDVI CLGLMLWGS 
SSIVCILHRHKRRVQHIHRTSVSPTSSP 
ESGATKTILLEQERTRSPRS 


8885 


22786 


A 


8951 


2 


201 


L YDLLN MR SISLPLPS L VC I H S HTHEHT 
YTH I HTHAHT YTHTHS HW VLS L KQ KKMV 
KHRPRLSVWI 


8886 


22787 


A 


8952 


1 


285 


GASPPASLPPCSLISDCCASNQRDSVGV 
GPSEPGAGYNLWRRVLSPSEKRSIRVG 
VTRFSRCRPSPLSLTRKGNSLTPCASRV 
RQCLTLLWLVH 


8887 


22788 


A 


8953 


341 


256 


LGLGGGGC S E PRWHHRT P AW VT ERDF I S 
L 


8888 


22789 


A 


8954 


227 


43 


I LFF IGKHTHTHTHTHTHTHTRAKCLFP 
PARFPTSPNSTTNCTFPTEPAANPRGLV 
RAGLG 


8889 


22790 


A 


8955 


310 


79 


RPRRPGLMERVNVF IFSILRES FYKIR I 
TNQAQCCMPIVPVTQETEAGRSLEPKSL 
RLAWAIQQNS ISKKN1TNNNFF 


8890 


22791 


A 


8956 


374 


198 


FFFFLPNRHGLAMLPRMVLHWVQAVLPQ 
PPKVLGLQG 


8891 


22792 


A 


8957 


138 


10 


RPRRPRLCQAQWLMPIIPALWESVAGGL 
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SEQ IO 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L.=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
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LEARSLRPAWFFFNT 


8892 


22793 


A 


8958 


354 


216 


AGPRVFPGGEGGFRPPPPGGSPPQGPPK 
GGESGGKPPPPAHWVFF 


8893 


22794 


A 


8959 


364 


223 


LQAARSKKKKKKKKKKKKKKKKKRGGGK 


8894 


22795 


A 


8960 


259 


11 


FEALKDPSNNLQSLFSFIVTVLTVAHVY 
EKLSTLTSLAARRGLAMLPRLASNSRTQ 
TILLPWPSKVLRLQARATEPGLGLLL 


8895 


22796 


A 


8961 


334 


213 


AASTKNFERMEWGIiTTLPMLVLNSWPQV 
ILLPWPPESMRL 


8896 


22797 


A 


8962 


260 


3 


KGGFFPIPFIGIKPGFFFSPALIFPKVF 
I FSQNFF VFLKKI FFFFFVETVS SHWK 
AGLELLASSNPPAFTSQSAGMTGMSHHA 
W 


8897 


22798 


A 


8963 


425 


248 


ADE W I HKMRH I HM I E Y YLAI KRNEML I H 
ATAGMHLENILSEKDHMYDS I YMKCLAW 
ANL 


8898 


22799 


A 


8964 


1 


118 


GKLME LHGE GS £ S GKATGDE TGAK VERA 
DGYEPPVQESV 


8899 


22800 


A 


8965 


2 


130 


GRVGPRMALNSWAQVIHLPWPPKVLGLQ 
VAMKRKIMILSTSY 


8900 


22801 


A 


8966 


119 


364 


LGWYLIEKIQICCSPCLGADPFQQMISP 
LRDGSLTVLPKLVLNSWPQVFLPPQTRK 
VLGLQAE VTAPGH I I FMI IMINTFK 


8901 


22802 


A 


8967 


238 


333 


TRCLRLLPRLVLNSWTQABLLPSPPKVL 
GDQV 


8902 


22803 


A 


8968 


297 


169 


NPPPPHIYKFFFIFNIFFFFFFFFFFFF 
FFFFFFFFMSKILIF 


8903 


22804 


A 


8969 


2 


212 


RVRPRDRPLVRVRVGRNKDGAETTPSPG 
LLPAHLTFPLDYHLNQPFIFGLRDTDTG 
ALLFIGKNMDPRGP 


8904 


22805 


A 


8970 


242 


357 


TLDI I KS F KKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKI I 


8905 


22806 


A 


8971 


191 


265 


ILMPXKKKKKKKKKKKKKKKKKKKR 


8906 


22807 


A 


8972 


398 


97 


MFLYNRILWFFSPHPPLRFFFSPRATIF 
LGGGAPHFPPPKKGFFSKKPPGVFLSPP 
QKEKIFSFPPPLFLAPPKDFFKSPPPFF 
FFFFFFFFVSTGSSVS 


8907 


22808 


A 


8973 


370 


265 


YNSSLPSSWDYRHVPPCPDNFCIFSRGW 
SQTPDLR 


8908 


22809 


A 


8974 


3 


119 


GLKLLTS S D P PAS AS KTAG I TGVSHHTQ 
TIFVFSSHKS 


8909 


22810 


A 


8975 


1 


110 


VAQ AG F KL>VS S ANL P TLAS Q S ARVTGVS 
HRARSMKQ 


8910 


22811 ! 


A 


8976 


3 


152 


WNQPKYPSTGEWMEKMWYIYTKLVQPMA 
HGPHVAQDGFEGSPTQIHKLS 


8911 


22812 


A 


8977 


375 


193 


FQDLPLGSTSNISFFKRWSPAMVPRLIS 
NSWPQE ILLPQTPKVLGITGMSHCTWPG 
GHISI 


8912 


22813 


A 


8978 


2 


395 


RAALPTQAS AMAGAP P PASLP P CSLS SD 
CCASNERESVGVGPSEPGAGYNLLVCYL 
LRLLEKRSIRVGVTRFSRGHLSPLSLTR 
KGNSLTPCASRVRQCIjALLWLTLSALiYP 
LSCTHCQTLPSEMNLVPQb 


8913 


22814 


A 


8979 


120 


19 


ARVQWHDHGSTATLNSWPQVILLPWPPK 
VLGLQA 


8914 


22815 


A 


8980 


204 


48 


LNRDKGLTMLARFVRNSWPQAI PLPWPP 
KVLGLQALATAPCQMS FLRAPLS 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GJutamine, R— Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8915 


22816 


A 


8981 


335 


221 


FFFLFLFFCMITSHCSWVLGASDSPTS 
VSHIAGTIGT 


8916 


22817 


A 


8982 


360 


150 


LSHMPIjIjFVSLYPDHRAFMQVI FFFFET 
ESPSVTRLECSGAILAHCNVCtiPGSSDS 
PASASYAGYSFYL 


8917 


22818 


A 


8983 


346 


116 


GQGRGVETESCSVTQAGVQWCDHHSLLS 
LELLGSSDPSTSASPVTGTTGGHYHTSS 
HFLFSVSSSYFPTPVCLAYR 


8918 


22819 


A 


8984 


350 


263 


LPMLPRtiVSNAWPQAIFLLWPPKVLELQ 
A 


8919 


22820 


A 


8985 


3 


159 


E E YNLLVCRLLS P LEKCS IGVRVTQFSR 
CHVSPLSLTRKGNSLTPCASWVR 


8920 


22821 


A 


8986 


2 


253 


KHWGELKNDLS I CGTLHS VDR YLNI KL 
TNISVIDSEKYPHMLSVKNCFIQGSWR 
YMQLPADEVHKQLLQDAARKEPLQQKQ 


8921 


22822 


A 


8987 


2 


115 


GAR I VGHLTHAL KQGE YGLAS I CNGGGG 
ASAMLIQKL 


8922 


22823 


A 


8988 


3 


159 


GFYHIGQAGLKLLMSGDLPASASQGAGI 
TGLSHCAQAGI FFFLP YD I FT VF 


8923 


22824 


A 


8989 


1 


123 


S H YVAQAGP KLLGS SNPPAS AS RSAG I T 
GVSHHAWPPSLFL 


8924 


22825 


A 


8990 


462 


303 


LARTPSRPTRPPTRPPTRPPTRSPTRPP 
FMLSVDCLLLY 


8925 


22826 


A 


8991 


382 


178 


PPFFFYFFFFSFFFFFFFFFQKGFKGFF 
PQKFFFFFFGGKPVKNFQKFFLRAKGKG 
GEKKPPQNKKVF 


8926 


22827 


A 


8992 


532 


432 


RDAGLTMLPRLVSNS WAQVLLLPGPPKV 
LTLQV 


8927 


22828 


A 


8993 


418 


172 


KKGKKKKKNI FL I F FNYF FFLIFFFIFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFGGAWSLFRGGDSGEVKGTDSMS 
RQCFQGF 


8928 


22829 


A 


8994 


2 


722 


AVRLNI S YPPQNLTMTVFQGDGTASTTL 
RNGSALS VLEGQS LHLVCAVDSNP PARL 
SWTWGSLTLSPSQSSNLGVLELPRVHVK 
DEGEFTCRAQNPLGSQHISLSLSLQNEY 
TGKMRP I S GVMLGAFGGAGATALVFLS F 
C 1 1 FVWRS CRKKSARPAVGVGDTGMED 
ANAVRG S AS QGPL I E S P ADDS P PHHAP P 
AliATPSPEEGEIQYASLSFHKARPQYPQ 
EQEAIGYEYSEINIPK 


8929 


22830 


A 


8995 


498 


323 


EPKAELNEGDIAVVHIKREEGWFKGTLQ 
RNAKTGLLPGSFVENI 


8930 


22831 


A 


8996 


21 


511 


L I IDGVEAYALNASGWNI I VFDPKGWA 
LFRSFKAVKEKLDTRRGSNSELETAVKD 
LGKAVSYKGMYGDVAIWYSGQYVENGV 
KKNFLPDWTMVLGNTQARGLRTYGCIQD 
ADAQRE G I NAS AR YP KNW VTTGDPARE F 
TMIQSAPLMLLADPDEPVSVQLA 


8931 


22832 


A 


8997 


178 


306 


GAINAINWPPFNNVGMGKTLGFPNbMAL 
KQ I PFS P VGE KTKRR 


8932 


22833 


A 


8998 


351 


211 


S I Q S LRMQ P VP C Y I S GRHHH YH SHHH IH 
HHHYSSYFL.KYPQEYLRPL 


8933 


22834 


A 


8999 


1 


381 


FRLFKFALMELCSIEGISDQPLFFVFGF 
FFFGKEAS FGPQGGREGGKFKLMEPLAK 
GVKGISCPKPPHPCLNPPKPKRGGESSP 
P E G PKKTGAP GPAKKGL F FLT KGKN P W P 
GPGKTRKKAVFPKNF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E— Glutamic Acid, F~ Phenylalanine, 
G=Glycine, H=Histidine, I— Isoleucine, 
K=Lysine, L=Leucine, M=Metfiionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8934 


22835 


A 


9000 


173 


399 


I KS E WAAKKKKKKKKKKKKKKKKKKKKK 
KKIFFFGGGRFFFFWGGDYFWGGGGVKK 
KGGGKKNPGPQRKNGGGKK 


8935 


22836 


A 


9001 


2 


81 


EFT P AVHAS LD KF LAS VS T VLTS KYR 


8936 


22837 


A 


9002 


3 


835 


DAVRANDDLKEWIAIVERRNNLLQAELS 
ELRAWEQTERSRKIiAEQELI ETSERVQ 
LLHS QNTSL INQKKKMDADLS QLQTEVE 
EAVQECRNAEEKAKKAITDAAMMAEELK 
KEQDTSAHLERMKKNMEQT I KDLQHRLD 
EAEQIALKGGKKQLQKLEARVRELENEL 
EAEQKRNAKSVKGMRKSERRI KELTYQT 
EEDRKNLLRLQDLiVDKLQLKVKAYKRQA 
EEAEEQANTNLS KFRKVQHELDEAEERA 
D I AE S QVNKLRAKSRD I GTKGLNEE 


8937 


22838 


A 


9003 


429 


280 


RGFRGHKFLE P PGFPGFP P I MGPPS PPF 
GSPPQKNFPLQPKPPLGFKPPP 


8938 


22839 


A 


9004 


2 


108 


LSAYQGTPLPAWIDDWQALNYEIRGYVI 
IKPLVWV 


8939 


22840 


A 


9005 


3 


158 


SSTQGHKHCGRPQGPLPRKTRDLCSPVY 
LLTFSPLLSYDPVTSPSPRNTQE 


8940 


22841 


A 


9006 


3 


370 


LLLTGLNRLTTDLYSLYI FTTTQWGSLT 
HHINNIRPS FTRENTLMFIHLCPILLLS 
LSPDI ITGIFSYTRHYVNSSTYLETNER 
GGS YKNRLGGPESKGLRI KDDSLCWGPL 
HDTSAYYENK 


8941 


22842 


A 


9007 


363 


76 


KLFNVGGGGLDKPPPPKKFFFFKKNGLF 
FFYPLKKKKNFFFPRELLWGPPRFFIKT 
PPPYFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFVMIS 


8942 


22843 


A 


9008 


1 


87 


KMLKKPKFELGKLMELHGEGSS SGKATG 
D 


8943 


22844 


A 


9009 


420 


3 


RFYFPKPRKRGGGGVFFFFPPKKKFFFL 
NPPPFFFFPPPKKKKNIFFPPRKGGRGS 
KVQTMGPPLFFFFFFFFFFFFFLFFFFF 
FFFFFFFFFFFFFFFFFFFLRETSSNNV 
LAYQ I ENS XjL KLDLQRQ F C L P L P GRV 


8944 


22845 


A 


9010 


2 


74 


LHRLGGPEAGLGEYLFERLTLKHD 


8945 


22846 


A 


9011 


33 


132 


IDPRCRNSARAPLSLAGPHPGMGDASNH 
MGQM 


8946 


22847 


A 


9012 


83 


607 


RRS GT P CG YNVT S QNGT IYSPGFPDEYP 
I LKDC I WLI TVP PGHGVYINFTLLQTEA 
VNDYIAVWDGPDQNSPQLGVFSGNTALE 
TAYS STNQVLLKFHSDFSNGGFFVLNFH 
AFQLKKCQPPPAVPQAEMLTEDDDFE IA 
ITGEGKPTRMPQSRKKSCHI ISVLISAK 
LNRAFL 


oy47 


22848 


A 


9013 


483 


364 


PPPPPPFL>NPAPGIFFPPPLGGIGARPP 
PPSFFFFFFFFF 


8948 


22849 


A 


9014 


74 


180 


LLSTYVGRLSARPKIjKAFIiAS PEYVNLP 
INGNGKQ 






2\ 


oni < 
yuLD 


400 


31 1 


£LV{j\^f\\j±i VIjLj J. oVjUir ir Xibfloyo io-JL J.Cjv 

SHRAQPLRSFSFVLSLLQKRIVLC 


8950 


22851 


A 


9016 


149 


13 


DERGGLQAVAHTCNPSTLGGRGGWITRL 
GDGDHPSCRILAGDASQ 


8951 


22852 


A 


9017 


20 


107 


FDSTADSDQVNPIQGLASKWDYEKNEWK 
K 


8952 


22853 


A 


9018 


448 


239 


VDHQQVI WNRERI SNSQNGI VKE I KGAD 
TF I FGHT PAVKPL KFANQM Y I D TGAVFC 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L-Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GNLTLIQVQGEGA 


8953 


22854 


A 


9019 


3 


209 


CPPLSPGCTNSARAEPSVHALPDALNNL 
RTHEGS GDGP S S S VDWNR P E D V YPQG I Y 
VISAPSIYAREA 


8954 


22855 


A 


9020 


504 


20 


N YAQMS YS TARAS ANE S WAYFMGRRKF V 
ASRQASQMFLCWLEEAIVRRWTLPSKA 
RFSFQEARSAWGNCDWIGSGRMAIDGLK 
E VQE AVML I EAGL S T YEKE CAKRGDD YQ 
E I FAQQ VR ETMERRAAGLKP PAWAAAAF 
ESGLRQSTEEEKSDSRPAAGS 


8955 


22856 


A 


9021 


3 


168 


LPRLEAAARIRHEERERERERERERERE 
RERERERERERERERERAPPPTLCET 


8956 


22857 


A 


9022 


2 


89 


LCSLVYLLTFPPLLSCDPAKSPSARNTQ 
E 


8957 


22858 


A 


9023 


508 


371 


FFCRDKVLATLPRLLANAWPQVIFLPQP 
PKVLRLQPCTTPSLKDL 


8958 


22859 


A 


9024 


2 


170 


RTHSAGRP YQCNQCEKAFRHS S SLTVHK 
RTHVGRETIRNGSLPLSMSHPYCGPLAN 


8959 


22S60 


A 


9025 


3 


185 


PPAATPQGPSPINSPPTKKKAKNKNRTK 
KTKKKKKP P PS KKKGP QKT P KKTMGG P L 
KTTT 


8960 


22861 


A 


9026 


104 


319 


LVFLSKYTPPLLVSLWIHGLSLLSFLPS 
LPSFLPSLLTYLLPSFLPSFLTSVINPL 
PSLFFLVFKLSQFWT 


8961 


22862 


A 


9027 


476 


364 


SLKRLTSGDLPASASQSAVITGVSHRAR 
PIIYFKYVQ 


8962 


22863 


A 


9028 


36 


154 


F L VGQAGL KLLT S GD L PAS G S Q S AR I TA 
VSHRTWRTHYF 


8963 


22864 


A 


9029 


2043 


1237 


WLRTRVAPALPERLGACTQLGPVLPCSQ 
PYWCRQCPEYRRQAAQPPHCPAPEGEP 
GAPQALGDAPSTSVSLTTAVQDYVCPLQ 
GSHALCTCCFQPMPDRRAEREQDPRVAP 
QQCAVCLQPFCHLYWGCTRTGCYGCLAP 
FCELNLGDKCLDGVLNNNSYESDILKNY 
LiATRGLTWKNiyiLTESLMALQRGVFLLSD 
YR VTGDT VL CYC CGLR S FRELTYQYRQN 
I P AS EL P VAVTS RP D C Y WGRNCRTQVKA 
HHAMKFNHIGEQTRFKN" 


8964 


22865 


A 


9030 


15 


389 


I VDH YP CGGIPTSG S VTQAG AQ WHNLG P 
LQPRP LGL EGS S CL S L I CS WGHRHAP PH 
LARGHFSSEVEQLRSRLQHGAVFVKQRK 
GTPEDPRQGLRTL I S S CLLHL VALAL WF 
SLPFHGPRIHMQ 


8965 


22866 


A 


9031 


3 


243 


RHSLAFNRFSCLSLPSNWDCRHPPSCPA 
KFCTFVEMEFHHVGQAGLELLTSGDLPT 
LAS QS AG I TGVS HHAWTRC CC C F 


8966 


22867 


A 


9032 


1 


245 


PTRPAG I AI C I CMCMKNHRATRVG I LRT 
THI NT VS S Y PGP P P YGHDHEME Y CADL P 
PPYSPTPQGPAQRSPPPPYPGNARK 


8967 


22868 


A 


9033 


46 


124 


QELMTHGAKSPDGTRVHNAFLFVTTV 


8968 


22869 


A 


9034 


235 


435 


KKKKKKKKKKKKKKKRRPFKLKGPFKPP 
LKPQGGLFEAPPFGGPPPPGFFFAGGGA 
PPRGFFKKKGK 


8969 


22870 


A 


9035 


333 


70 


CPIYQIPILKDQVNLSSISSPFSHLIKD 
LWKTL F I S Y I TE L I F VC F VEMM FHHVAQ 
AGLELLGS SNPPTS TFQSARI TGI SHHA 
CLNI 


8970 


22871 


A 


9036 


297 


411 


NKTMS FAATWMQLEVILSKLMQEQKTKY 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X=Un known, "—Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CMSSLISGS 


8971 


22872 


A 


9037 


461 


167 


NKNYPLLGGPPFGSPSPENLNGGFPLTP 
GGGGP INPNS P P PP P PGGRRP P FLKKKK 
KKKKEKETNTYILQDTCDRCSAVLiPIMQ 
VKIVMI CDIMHRLF 


8972 


22873 


A 


9038 


466 


292 


NGCLLEPASWNSLANIFIFCREEGLTML 
PRPQVILPPLPPKVIiRLLPPPLSS 


8973 


22874 


A 


9039 


34 


124 


GIRNIKFSSLSTFIAHKKYYCSSHAAEH 
VK 


8974 


22875 


A 


9040 


54 


239 


RRRRKLRRWGLAMLPRLVSNSWHQAILW 
PPKVLRLQACATTPGLTRRWLHGQIWSP 
GAGSV 


8975 


22876 


A 


9041 


469 


223 


PWARVCVCVCVS VCVCACVRVCARASE S 
QTCAAVPSLGPAGAGRPALGCCAVCEDE 
VGSCPHARPLPRAAVDKAAGRRGLCF 


8976 


22877 


A 


9042 


491 


380 


SPGWAHSYNPSTLDGRGSHITRVRKFM 
TCLATEGAV 


8977 


22878 


A 


9043 


144 


360 


IHSHWGYWLDVSLIAAHLAINPDCFMYV 
YIFSQCTDKLTEKEFIEGTIiANTKEILRL 
IQFEPQKVKEKMKNA 


8978 


22879 


A 


9044 


23 


263 


RWNSDGKGSI>RPAWATWPDPVS IKNTQN 
THTTHPTPHSPHTPHGQQPFLSPTPFPW 
PVSSPSPFPPFSHTCLSCLLPPGA 


8979 


22880 


A 


9045 


461 


287 


PPSIiANIYLFIYLFIGRDGVSLFPRLVS 
NCWAQAILLPWLPKVLGLQGEPPNLALK 
SL 


8980 


22881 


A 


9046 


485 


6 


NNKNS WAW WRVP VL P ATREAGAGE S LE S 
GRRRLR 


8981 


22882 


A 


9047 


475 


307 


LFLRDGSLTVLPRLVSNSWPQAILLPWL 
LKGLGLQARATLHMACSSCINSTQNILQ 


8982 


22883 


A 


9048 


483 


338 


LREGERQREQERERERERERERDRERAH 
TTAFTQLLFQYVAIILIiPCY 


8983 


22884 


A 


9049 


680 


303 


NDLLFGSLYVKQFAASGTGRVTLPEPSE 
EQWCLRCFQDLFGDDHNKNGFKMFDSS 
N YHG QDLL F KDAT VRAVP VGE KTT YRGW 
LGLDYVAALEGMSSQQCSGAGRTGPPCR 
LPWCYSDPFTPEP 


8984 


22885 


A 


9050 


26 


165 


AFG YHRVGQAGLE LLTS GD P FAS AS QS A 
GITGVSHHARSQYFFVYV 


8985 


22886 


A 


9051 


461 


83 


HKINLPSHKKKKKKKKKKKKKKKKKKKK 
K 


8986 


22887 


A 


9052 


475 


209 


MAAPPKKKKKKKKKKKKKKKKKKKKI 


8987 


22888 


A 


9053 


531 


404 


S VTQAAMQWQKHNS LQGSTLE FKQSSLL 
SLPTIWDYKHADLS 


8988 


22889 


A 


9054 


66 


416 


KKKTNKIKMGGGPLPFKISPPFLKKTLS 
WKKGAPKTPPLEMAPFFFPQGGKKNS PF 
KGGKIFFFFFPKKTFIKGPPGGEHTPPL 
GGGPQIFSLPPPKGVPQKNPPPLLRGGL 
FFTI 


8989 


22890 


A 


9055 


1 


104 


ILAVNGETLEGVTHEQAVAILKHQRGTV 
TLTVLS 


8990 


22891 


A 


9056 


3 


106 


LKRWGIAMLTGLVLNSWAQVILPSWPPE 
VLGLQV 


8991 


22892 


A 


9057 


1 


299 


PTRPINI ILAFTISLLGILVYRSHLISS 
LLCLEGIILSLFIIATRIYPIAPPLPPL 
EPEKKKKKKKKKKKKKKKKKKHKKNTKK 
RKI KAKRKNLGGGGL 
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SEQ ir> 
NO: of 
nucleotide 
sequence 
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NO: of 
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od 
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NO: in 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W— Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


8992 


22893 


A 


9058 


482 


146 


LTSS KKKKKKKKKKKKKKKKKKKKKK 


8993 


22894 


A 


9059 


2 


290 


YFLFAYTILRSVPNKLGGVLALLLS ILK 
KKKRGGPFKRTLGGPKFNRGGQGNIFFL 
MGGGLKPNLGILGRNLSLGGGKNWPNLP 
PKIKGFRENKNF 


8994 


22895 


A 


9060 


413 


48 


PQKKGFFQKNPKGFLKPPPKKKKKKFPP 
PGKIWPPQKIFKKPPPPFFFFFFFFFFF 
PKFFFFFFFFFFFFFFS IRNPPSS YTKW 
LRSLDVCHLLPPITPQVPGPTPVLLFST 
LPDCNHSLL 


8995 


22896 


A 


9061 


2 


274 


FTISLLGILVYRSHLISSLLCLEGITLS 
LFIIATLITLNTHSLLANIVPIAILVFA 
ACEAAVGIALLVKKKKKKGRPFKRILGG 
PQINPG 


8996 


22897 


A 


9062 


42 


255 


QEFRTRRERERERERERERERESSGWR 
AKFRSNFSAGAIAHTIRVMLPPSGIYSN 
EKSINKCGFALLFF 


8997 


22898 


A 


9063 


20 


413 


YALFJNTS ARAGD PHGQ LGRRD GAVGGRG W 
LMPREALHPWWANVQAWAGS PGLFLGAP 
WPQGGVWLPWLCVSMSASSSS INGLPPQ 
KKKKKKKKKKKKKKKKFWGGGPGPQS P F 
WGGLKKFGGRFLKRGGGEN 


8998 


22899 


A 


9064 


6 


306 


ECEHSTTYSHFAGGGGPVMVGGDADARS 
KALLGVCVGSGTEAYVLVLDPHYWGT P K 
S PS E LQAAGW VGWQE VS A&FD PMS F YNL 
CLTSLSSQQQQRTLD 


8999 


22900 


A 


9065 


463 


126 


AHYQLVYTGKVCGTRSSKRI S KLAYHQG 
WIVTCPGCQNHHI IADNLGWFSDLNGK 
RNIEE ILTARGEQVHRVAGEGALELVLE 
AAGAPTSTAAPEAGEDEGPPSPGKTEPS 


9000 


22901 


A 


9066 


23 


244 


S CHAGIRHEERERERERERERERERERE 
RERERE X YFIiAL FS QKGRVHGFFKTPRG 
GQKRELLSNRVNTLCTE 


9001 


22902 


A 


9068 


396 


143 


KDIQNFGQAWWLTPVIPALWEAEAGGSR 
GQEIETILVNMVKPHLSVLLAKNSSSHC 
PMCWLAELQWYDL CS FQKI FFG 1 1 L I PG 


9002 


22903 


A 


9069 


828 


99 


VDREPVDREPWCHPDLEERLQAWPAEL 
PDE F F E LT VDD VRRRLAQ L KS E RKRLE E 
APLVTKAFREAQIKEKLERYPKVALRVL 
FPDRYVLQGFFRPSETVGDLRDFVRSHL 
GNPELSFYLFITPPKTVLDDHTQTLFQG 
EPLPGRSGCNLGAEEPAGVYLEPGLLEH 
AISPSAADVLVARYMSRAAGSPSPLPAP 
DPAPKSEPAAEEGALVPPEPIPGTAQPV 
KRSLGKVPKWLKLPASKR 


9003 


22904 


A 


9070 


20 


427 


DGTRRERERERERERERERERLYLTEDT 
LHS L S LE GARSGAS VC VQS V YP TGASAP 
HS I INGHKRVLVLCVETEHTFPLAEKDR 
AFLGDTSVFVGAAPVGGANTGASAGGAP 
PSRGEPKEESGESHEDMGFGLFV 


9004 


22905 


A 


9071 


297 


126 


CS I YDKRI PTFGLGEGVSERQRLTLLLP 
CLPPALTETPTSSDWQSTDATPTLTNSS 


9005 


22906 


A 


9072 


117 


286 


WTjRGSEKMFQVQEINEQYLQCILIKGI 


9006 


22907 


A 


9073 


3 


302 


PPMETLRQWTTIVHSQEEEEEHFEVIKD 
EVKWARKHGQPGTPVAIATQLPPRTSA 
AFPAQQQPLQPLAPTTVLSVSADWNALL 
QRPNKYIWGLQYLEM 
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ing to first 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L— Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X~Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \— possible 
nucleotide insertion 


9007 


22908 


A 


9074 


437 


79 


NQKREEEEERRKKRRRRRRKRRRRKKKK 
KILRQKKKKRRKAGSREKKKILRHLPEK 
NKRSKKHYELINALSRLQDTRLIYQKMH 
FYILAKNNWKI KLKIQYHLQALENKVPR 
HKWETYA 


9008 


22909 


A 


9075 


51 


185 


CRADFMFFHDKI IVYLCSLVYLXjTFPPL 
LiS YDPAKS PSARNTQE 


9009 


22910 


A 


9076 


452 


310 


QFSRCHLSSLSLDRKGNSLTPCTSWVRR 
CLALLRLTIiGALHLLSDNPQ 


9010 


22911 


A 


9077 


438 


326 


LLHCWWDCKLVQPLWKSVWRFLRDLELE 
IPFDPTLHF 


9011 


22912 


A 


9078 


37 


169 


GIPGFRHAGQRRLQLLITGDLPTSASHS 
AG I TGTS DRARP P S TF 


9012 


22913 


A 


9079 


1 


408 


RKGRL S KE E I E RMVQ E AE KYKAE DE VQR 
ERVS AKNALE S YAFNMKSAVEDEGLKGK 
ISEADKKKVLDKCQEVISWLDANTLAEK 
DEFEHKRKELEQVCNPIISGLYQGAGGP 
GPGGFGAQGPKGGSGSGPTIEEVD 


9013 


22914 


A 


9080 


31 


168 


VRRNPRLEHTVTITAYCSLELLGSSNLP 
AS AS Q VARTTGLQ L LNLQ 


9014 


22915 


A 


9081 


460 


319 


YFFVEMGSHYIAWAVLELLGSSNPPASA 
S QS AE FTDAHKPTD GRPLR 


9015 


22916 


A 


9082 


465 


326 


P CVAE FCTHTHTHTHTHTHT FKCVGSHL 
HTNWD 


9016 


22917 


A 


9083 


33 


269 


PGNSRPXSSVRGIHPPCLANFCFLNVFV 
EMGSHYWLAGL ILGSSDLSALAVQSGR 
VTGMSHCAWPE I FFLLENWI IL 


9017 


22918 


A 


9084 


47 


300 


DLITSLFLLKQSSLiATMNIQTPFKNIYF 
DIjIIKTLFLRFFIFCFIYFYFLFFSYGG 
FFSFFFFFIiKGSSFLPPRLNGGAKIWW 


9018 


22919 


A 


9085 


437 


220 


MIAIKSLSLSLSLSLCVCMCVCVSACVC 
MCLSLSLFSPSSLSHSLFPFSTPNMDIS 
HLPILSTQENMLLLIiQ 


9019 


22920 


A 


9086 


227 


23 


DCPKANILGLTPRLVSNSWPQVILTPWP 
PKGQSLLCRQGEGDRIWTWHPWWLLHPP 
PQARFPPRTHFH 


9020 


22921 


A 


9087 


206 


326 


PHWAQWPTPVIPALWEAEAGKSQGQEFE 
TSLANTVKPRLY 


9021 


22922 


A 


9088 


364 


123 


SRWLPLKAPGEDTSLPLPASGGSRCSL 
AWGSIPPISASIPTWPSVLCPCEIFPLL 
FLLYGDCNLGPTLI PDDLIMRSLP 


9022 


22923 


A 


9089 


373 


205 


MPPRLAKTFCYFSRDOTSPYWSGWSQTPC 
WDYRREPPFPPCVRTSYSVQYCACYAGL 


9023 


22924 


A 


9090 


283 


2 


REFS FSAWLAFKI FWKLFS PPFWRVRP 
EDWFIPGGCGPNEPNFPPCPPGVGKKGE 
PRFKKKKKKNF I S SQAQWLMP VI PAFWE 
AKAGGSIiEP 


9024 


22925 


A 


9091 


16 


261 


CPEDLLVGQNVKVEDSDMGDPDRRPMCM 
FACF YLLVYRRCLAMLPRLYLNS WAQV I 
LLPQPPASLELLQAHATTHPALCLSF 


9025 


22926 


A 


9092 


17 


109 


T1 f~\r~1T 7\ IVfT DDT T 0"NTO Tj,T~T>f~\ A tvTT T T)Ta7TV'0'C^"CT 

KU CjJjAMijlrKJji X faJN W I yAN J_il>Jr' WPFKJbiJj 
GL 


9026 


22927 


A 


9093 


282 


114 


KKGGKGALLVFLKTGRKP FFFFLRQGLA 
MLLRLVLNSWAQAILIaPWPPKVLGLQA 


9027 


22928 


A 


9094 


205 


337 


NANLHGRMQRLT P CL PS I WE AE TGGSQ G 
QE I ET I LANMVKP CLY 


9028 


22929 


A 


9095 


62 


287 


EHSCAKLFCFAFESGPYSVAQAGVQWHD 
HSLLQS QTPGLKQS S CLS FPASAGF YKE 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K— Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, v — Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














S P S F YKDKS CHHLGKH S GA 


9029 


22930 


A 


9096 


170 


9 


AASTGYCIGRFCTKLKIQSGGQVRWLSP 
WP KREAE VGGLLEARS SRP I WATW 


9030 


22931 


A 


9097 


2 


104 


RRPGGRAEPRARVRGDRDPPVLLRHYPH 
IFEECG 


9031 


22932 


A 


9098 


350 


142 


LCIiS CPKGGGYRAMAPWRGNFCSFGKNG 
GFPMLGRWLKWPQMGPLPQPPKRVGFQ 
GLSPPPGPPPFLV 


9032 


22933 


A 


9099 


494 


342 


YFYFYFVEMVSHHVTQAGLELLASSNPP 
AL I S QHAVI T S G SHHTQ P KL Y F 


9033 


22934 


A 


9100 


30 


232 


GGIMLCHGRCACFPFAFCHDCKFPKVFQ 
PCFL YS LQNCS FLYKI P SLRS CRLQLAE 
LLSIIYLENPL 


9034 


22935 


A 


9101 


26 


171 


RNSRRLRHKNRLSLGGGGCSEPRSCQCT 
PAWQTARLSLKNKKQQQKKP 


9035 


22936 


A 


9102 


486 


239 


HHTCPHQVCVCVCVCVYRVCVCVCVCVY 
IYMYMCIYIYIYI YICIYICRERERERE 
REMWFHHVAQAGLELLGSRDLPAPAS 


9036 


22937 


A 


9103 


487 


344 


TGFHHVGQAGIELLTSSDLPASASQSAR 
ITGVSHHAQPLFMI FCAKC 


9037 


22938 


A 


9104 


470 


221 


TPGLKRYTCLSLPKCWDYRCEPHARQGA 
FLCAFSPLSSQQPWGKRPHFHFSYETNK 
VERGKRLPLRSPSTEQGFDFRCVLFQT 


9038 


22939 


A 


9105 


468 


259 


YVFF CRDGGLT I S S RLVLNS CP QQS P CL 
SLPKGWDYRHEPPCPSCCSNLLLMFHFT 
NVFLIDSECSHFTP 


9039 


22940 


A 


9106 


466 


329 


RVAGTTGTHHHAWL INI P S VETGFHHVG 
PQVIQPPRPPKALGIiQA 


9040 


22941 


A 


9107 


409 


306 


AGFEIiLASSDLPASASQSAEITGVSHHA 
RPSLIF 


9041 


22942 


A 


9108 


417 


269 


M WAQADMES S KE PGLFNWI INDSLVQ 
AYPEL KEAL YEE I KKA.QRRGA 


9042 


22943 


A 


9109 


533 


399 


RGS LRHE TRLNAE ARRC S E LN S RQC T PA 
WATERDAVS KKTKQ KA. 


9043 


22944 


A 


9110 


16 


239 


L VH F E VRWNSHE VS LTAD S RKS HLI F P Q 
PNYADTLINQESYEKSEPLLITQDLLET 
KGEPRQLQVSFFPPKREE 


9044 


22945 


A 


9111 


68 


219 


EPPHLANFLQRLVFTMLLTLILNSWPGV 
ILPPGPPTVLGLQDKLPVLLPS 


9045 


22946 


A 


9112 


192 


57 


DRVPLCPPDWSAVARSWLTAASVSQVKG 
ILSPQPTECAPPGNST 


9046 


22947 


A 


9113 


21 


201 


TIVSWEFMNLGGGTCGEPRWCHCTPDW 
GTE RD S VS GEKKE D KD VARRHW VGS F YN 
GPQ 


9047 


22948 


A 


9114 


2 


144 


T FT I DRC AKDL F VAKQ VG T KL S VNE PBS 
FSVES ILKRPSSAITRVSQ 


9048 


22949 


A 


9115 


156 


316 


MILNKKKKKKKKKKKKKKKKKKKKKKKK 
KGGGPFKKTIiGGAKITPGWKNYFFFFMG 
GV 




22950 


7\ 
jrt. 


yiio 


1 


84 


T-i-r/— pjrri -r-> TPT nfT T r PT TTT T T TlTCntPTT T T T 


9050 


22951 


A 


9117 


1 


176 


HLNLGGGVCSEPRSCHCTPVWATEILSQ 
KKKRKNGFKFWEPRPQKEGGGGRKKPPG 
VP 


9051 


22952 


A 


9118 


1 


184 


AAAGYPGQQQNFHSVREMFESQRIGLNN 
SPVNGNSSCQMAFPSSQSLYRTSGAFVY 
DGSKF 


9052 


22953 


A 


9119 


69 


374 


I IFSKKKKKKKKKKKKKKKKGGPPKKKP 
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Predict- 
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nucle- 
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Amino acid sequence (A=AIanine 
C=Cysteine, D— Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GJycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, jYI=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, Y=Ty rosin e, 
X=Un known, ""—Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














WGGPKIFVLRRVLSFFIRGEKKNTFGDF 
L KKT F F WGGE KLGKP P P KKKT P WGKKK I 
FKGERGKKPPFLGGKKK 


9053 


22954 


A 


9120 


405 


93 


EFSPLLLTTNQKRGFFPPSPPKIFFFPK 
RIYFLGGGGPHFPPPQKRGFPQKNQKGF 
FFSPPKEKKYNLPPPGKFCPPQGFFFPP 
PPLFFFSFFPFFFFRCQIK 


9054 


22955 


A 


9121 


2 


380 


WDYRCPPPHPTNFCIFSGDGVSPHWPGW 
S WPQVI CLPQPPKVLLTPWVRDRTRDLE 
GGDLW YGACLGACLHLGPQAAS FGHG I L 
G PAS LLCC I PKHVHMDTLS PGVKLQS CD 
TGVALRVGEKREGS 


9055 


22956 


A 


9122 


2 


534 


DMIREQKIYHKYLAQRREEEKAQEKEFD 
RILE ED KAKKLAE KD KELRLE KE ARRQL 
VDEVMCTRKLQVQEKLQREAKEQEERAM 
EQKHIWESLKELNCEEKENFARRQRLAQ 
EYRKQLQMQIAYQQQSQEAEKEEKRREF 
E AG VAAN KMCLD KVQ E VL S THQ VLAQN I 
HPMPQGMPQ 


9056 


22957 


A 


9123 


389 


46 


GPNFLPPPIFGFFSPFSPLKFFFSPKTL 
IFLGGFFPIFPPPKKSFFPKNSPGVFFC 
PPFKKKNFLFPSPLNFAPPRVFFKPPPP 
FFFFFFFFFFFFNFYRLYFIWWGLAKV 

vw 


9057 


22958 


A 


9124 


422 


59 


AQQKKKKKKKKKKKKKKKKKKKKKKKKK 

K 


9058 


22959 


A 


9125 


333 


168 


KGPP C I WCMVEFHKLEAGSHS VAQAGA 
QWCDGNLLQTLTLGLKRSPHFSLPSR 


9059 


22960 


A 


9126 


175 


89 


KPPPHFFFFFFFFFFFFYFFFFFFFIFC 


9060 


22961 


A 


9127 


368 


250 


CFNYLSRDRGLPLLPGLVLKSWPQAILL 
PQPPKMVGVQG 


9061 


22962 


A 


9128 


392 


157 


FFSPKRLNFGGGGGPKMPPPKKKVSPQK 
PHPGFFSPPQKKKKKFFPPPVKMGPPKS 
FLKSPPPFFFFFLISSIRGLFL 


9062 


22963 


A 


9129 


396 


13 


FPPPPPPIFFFSPRRFFFRGGGSVALSP 
PPKVFFLLNPPPVFFFSPPFKEKTSFFP 
PPFFCPPPQILFIiAPPPYFFFFFFFFLF 
VFFFLFFFFLLPLLFLGKTSFLLLLFLF 
I YLE TGS CS VAWARVQ 


9063 


22964 


A 


9130 


55 


180 


Lli YNKLt I FTLKKKKKKKKKKKKKKKKKK 
KKKKKEKYEKTKAY 


9064 


22965 


A 


9131 


2 


95 


S FVHMEHE KAVLLLKS FQNTVDLGI QRE 
LTV 


9065 


22966 


A 


9132 


172 


404 


KKKKKKNKKKKKKKKKKKKKKKNWGGGF 
FKN S WGGPKKRGGGK I FFFS LLGGKKKT 
LGVFLEKPLFLGGRKMAPPP P 


9066 


22967 


A 


9133 


1 


125 


QPVSYPFTI IGQVAYELYFTTILILIPT 
MS Li I ENK I L KWA Y 


9067 


22968 


A 


9134 


1 


120 


QPVSYPFTI IGQVAS VLYFTTILILIPT 
I S L I ENKI LKWA 


9Uo5 


22969 


A 


9135 


7 


269 


IAI GQPKLy rLL 1 LDP 1 LLLKRFLLTKKK 
KKKKKKKKKKKKKKKKKKKKKWGGGGHK 
KKPPGGPKTYPVKKKIFFFFKGKIKKPP 
WGI 


9069 


22970 


A 


9136 


1 


64 


LVS I SNT YGLDYVHNLNLLQ C 


9070 


22971 


A 


9137 


400 


259 


SQPLRRLRHKNILNPGGGHYSELGSCHC 
TPAWDTEQDSIHSWLLSSV 


9071 


22972 


A 


9138 


3 


310 


ECFPNSPNMNGHCNLLVQSTSPS ISTMP 
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Predict- 
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nucle- 
otide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R==Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CPKYVALEKVFLLNVLWFFYLKYRDNRP 
GSVAHTCNPSTLGGRGGWITGAQEFETS 
LGNIARPSLIKRKKYKLF 


9072 


22973 


A 


9139 


34 


142 


NCKKLNTHTHTLTHSHTHMLTHSDTFTH 
THTHTKYK 


9073 


22974 


A 


9140 


146 


2 


PPPPPPEGVSLCGPGWSAVARFRLTASS 
ASRVHAILPPQPPQQLPTRP 


9074 


22975 


A 


9141 


3 


102 


PPASASQSAGVTGMSHHAQPIEPLRCSS 
HRTT 


9075 


22976 


A 


9142 


245 


51 


NREGDLLCCPRWSQTFGLKQSCLGFPKC 
WEYRRGPPHLAYIMSMRKNILIQMIFYY 
NNPCLLYS 


9076 


22977 


A 


9143 


2 


108 


GAKCVSDRIKHAFLIKEQKI IVKVLKAQ 
AGSQKAK 


9077 


22978 


A 


9144 


3 


200 


MGFLHVGQAGLKLPTSGDLPASASQSAG 
ITGVSHRAWPVSGFFCSEAKECFWEEQL 
TNLDKNSQK 


9078 


22979 


A 


9145 


405 


227 


WFFLGFPLKGKLGPPKMFFRAPRVYSPF 
FPPFFFWPPPPRI IFPPGFFKFFVFLAG 
PIW 


9079 


22980 


A 


9146 


387 


220 


PLFFVGCFFFFFFFFYYFFFFFFFFFFF 
FFFFFFFFFFFLGGCERNFI IRKNINN 


9080 


22981 


A 


9147 


539 


220 


VINFKHILIKTVIQSLFFFERGSCLPPR 
L»E CNS M I MAH C S LNLLGS S HP P ASAS Q V 
AGTTGAYQHTQL I FIFFAETGS CYVAQA 
GLQLLGSSNPPPWPPEVLGLQV 


9081 


22982 


A 


9148 


3 


141 


GGGGCCEPRWSHYTPAWATSAKLCL.KKK 
KKKRFLEKSCLTPIiliKA 


9082 


22983 


A 


9149 


413 


240 


FIFCRDGGLTMLPGLTSNTWSQVILPPQ 
PPKVLGLQAYHADELPHLATWGVLYGLT 
V 


9083 


22984 


A 


9150 


395 


193 


LLFGP CLLNL I TQ FVSCHLLD I KLQMTL 
SEGYF PLNI QE S P F YRAPLDD P S VRHAR 
GEILPISLGPG 


9084 


22985 


A 


9151 


1 


77 


LVAAIVLGGTGWGFKNIRRGRSGGA 


9085 


22986 


A 


9152 


365 


181 


I VDRPD P CRDADAQREVFTLS ARYP K I W 
VPPGDPAREFP F I QSAPVMLIiAAPDELV 
SAQLA 


9086 


22987 


A 


9153 


373 


251 


FKKI FFCRDRALNMSPRLVSNSRTQAI L 
PPQPLKVLRLQA 


9087 


22988 


A 


9154 


204 


95 


DYHWSQPGMVAHACNPSTLRGRGRWITR 
SGDRDHPG 


9088 


22989 


A 


9155 


44 


280 


AELFGTLMREDGLSLGGGGCKEPKSCHC 
NPAWATEPDCLRKKRKESPYLLWTPPD 
SS YS PFFFLLLPAPDNWDYFLS 


9089 


22990 


A 


9156 


2 


117 


ERLRHENCLNLKGRGCSEPGWCHCTPAW 
ETKQDSISKK 


9090 


22991 


A 


9157 


3 


207 


HASAHAS ADAWD YRRVP P CPANF C I L YV 
SHVGKAGLNLLTSSDLPTSASQSAGITG 
VSHHARTPTYV 


9091 


22992 


A 


9158 


117 


267 


NRVLLCCPDWPQTPGLKGSSLSLLSSWD 
YRHI P PRLAARS S YLGPGFYH 


9092 


22993 


A 


9159 


415 


167 


LFFFP I KNXFFFS PPLFFFFFFFPPPPX 
XXFFFFFFPFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFLELKQNF 


9093 


22994 


A 


9160 


217 


41 


S SQ IHRDRNRMVRQGAVAHACNP S TFGG 
KGEQITRAQEFKTSLANMVKPRLYYKPC 
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nucleotide 
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ding to 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine 3 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va)ine, 
W=Tryptophan, Y=Tyrosine> 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LY 


9094 


22995 


A 


9161 


341 


164 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFYVCF 
VSVELKKWQNYSFDWIILRVEP 


9095 


22996 


A 


9162 


190 


402 


I HTGKHP KC YRN S YS VLFL AS KKKRKKK 
KKKKKKKGGGFKKKKNLNRGGGEIIFFF 
WGPKKI FGGGGLKKG 


9096 


22997 


A 


9163 


108 


209 


LS VKRCAKNPKKGKVAPG I KE I RTRSHQ 
DSEDI 


9097 


22998 


A 


9164 


2 


145 


KHCGRPRPQGPLPRKTRDLCSLVYLLTF 
PPLLS YDPAKS PSARNTQE 


9098 


22999 


A 


9165 


170 


402 


ISRCHFPSLILSYVKITKTIKAPKKSKK 
KKKKKKKKKKKKKKKKKWGGGPFKKKNF 
LGGGGGE I FF FLGAP KKKRGG 


9099 


23000 


A 


9166 


399 


113 


WRLRWEDCLSLGGRGCSEPGS C YCTPMW 
VTEQDQSSGYEGFQRPSSLSKARIPSVA 
AERSMCQPACVCNLS VEHLSLYS CLLAQ 
QGKALYKLLNK 


9100 


23001 


A 


9167 


3 


138 


S WI TG VCHYAWL I FAFL.VE TGFHRVGQA 
DLELVTSGDATTLRSH 


9101 


23002 


A 


9168 


134 


301 


EKLTQSHSVTQAGVQRLNVHSLRPPPPR 
FKRFSCLSLPSSWDYRNLETVLQEETH 


9102 


23003 


A 


9169 


201 


1 


GAT I LRFKTKKKNVF FKKKKKKKKRKKK 
GKGWSGTESRWNLSGEGSSGLRRRKLQA 
PAGPQEFEKLiA 


9103 


23004 


A 


9170 


2 


191 


FVPQPLPLGFKRFSCLSLLSGWDYRCTP 
PHLANFCGFFFLKFFFFFFKKKNFFSPR 
GRGGALI 


9104 


23005 


A 


9171 


2 


191 


GGTE S C F VAQAAVQGCDLGS LQSPPTGF 
SETASQTVDPDKVKDPGKLRTMFVAELP 
KTIVTDS 


9105 


23006 


A 


9172 


3 


143 


SWFLYVGQAGLELLTSGDPPALASHSAG 
I TGVSHRARLQTG I FRAV 


9106 


23007 


A 


9173 


361 


174 


VLGIILSMSETEIFNTKCLDQSGTVAHT 
YNPSTLGGQGGRVIRAQEFETSLGWTGR 
PHLFKI 


9107 


23008 


A 


9174 


493 


351 


LVETGFNHVGQAGLELLTSSDPPASASQ 
SAG I TG VS HLAL VL I S RKS C 


9108 


23009 


A 


9175 


1 


115 


FRKIWYIYTMEYCIAVKKKEMLSFTITW 
MNLEDFMLVR 


9109 


23010 


A 


9176 


286 


96 


PLFFFNRWGLAICLGWSAVAIHRRDHSA 
LQSQTPGLKRSSCLSLLSSWDYRRVPPC 
LANE SHY 


9110 


23011 


A 


9177 


399 


237 


RNTKWLFFLRRSLTLSTGIiECSGAISAH 
CNLRLPGS SDSLANGI FKDLALLDM 


9111 


23012 


A 


9178 


427 


101 


GRTGRQ GRG APH I SHDGRPGRDAPRFPD 
VMAAGKRRS SLPRWDGGRAETLLTFQTG 
QPGRGAPHI PDDGRPGGDAPHFPDGWA 
GQRLQS QHFGRPRQAAGRWRL 


9112 


23013 


A 


9179 


193 


41 


I VFLF I YL FCFFVGTGP CYVAQAGLKLL 
VS SGPLASAPQHVG I TGMSHHT 


9113 


23014 


A 


9180 


292 


119 


KI TKINKQ I LNLS YTLDQMDLTD I CRI F 
HPTATEYRFFSRAHGTFFRSGTKNSAWH 
I 


9114 


23015 


A 


9181 


3 


279 


ALSPPYSSECCKLGSFNRELSSPSPGGQ 
KSEIKVSAGPRPLPRVQGDPSPPLPASG 
GPRCSWACGQ I TAVS ASGVTHPS P PLIjH 
VCCVCLI 
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nucleotide 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
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ding to 
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peptide 

sequence 


Amino acid sequence (A=A!anine 
C— Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine ? M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9115 


23016 


A 


9182 


402 


238 


QMQWHNLWPLQSLPPEVTPLFCLNLPNN 
WGYRWPPPPPTNFFFLKGVSLGRPGSV 


9116 


23017 


A 


9183 


3 


207 


EFHHVGQDGLELLTSGDPPASASESAGI 
TGVSHRAQQACRFLHFFEGTCFLSQELS 
YLKLGRDTIIC 


9117 


23018 


A 


9184 


3 


106 


YPHYVKSIVASTFIISLFPTTIFMCLDQ 
EVIISN 


9118 


23019 


A 


9185 


110 


248 


GVLVGSPLRTRFHSQLQAEGAEEIGWT 
WMGDGVLPVCVVLEVDV 


9119 


23020 


A 


9186 


925 


1132 


KCPS AWAAHPTS E I R I SQGGS QAAWL V 
LFFLLKNPGDVI PALWEAEAGGSQGQQF 
ETSLANMAKPRLY 


9120 


23021 


A 


9187 


399 


292 


ASQVAGTTGAHHHARLIFFVFLVETGFH 
HVSQDENF 


9121 


23022 


A 


9188 


7 


82 


PCDPPASVSQSAGITGVSHHAWQIF 


9122 


23023 


A 


9189 


1 


117 


LPISLPSFLPSFLPSLLPSFLPSSFLPF 
FFPSSVTPYLK 


9123 


23024 


A 


9190 


238 


46 


NKAFRI KAFVS PSNKKFSS FFFFLNEMK 
SHYVDQAGLELLSSGNPS ILASQNAKI T 
GVSHHVRP 


9124 


23025 


A 


9191 


2 


282 


LIFYFVDNMLVRAGLELLSSSDLPALAP 
KVLRFTGVSHHTRP EL YFQRLL APGTLL 
INSQAAGTNFPIGLSRIKPTQSGHGMLQ 
EVWGTSGLR 


9125 


23026 


A 


9192 


44 


136 


CLCVCVCVCVCVCVS CKQTLMCCFLENF 
DV 


9126 


23027 


A 


9193 


469 


347 


PCPANFVFLVETGFLHVGHAGLKLLISG 
DLPNSDQKFYVN 


9127 


23028 


A 


9194 


427 


238 


P PARPTAF P YIE F FVEMVFRYFGKAGLK 
VLISGDLCASASPGVGITGVSHHAQPK 


9128 


23029 


A 


9195 


34 


178 


SLVHWEFERGRAEWLTPVI PALWEAEA 
GGSRGQEMETILANTDFWTG 


9129 


23030 


A 


9196 


22 


338 


CGPYSMWWNS IKRERKEGKKRKKKRRRK 
EQKERERKERKERREGGREGKERGRREG 
KKRKKNKRKKVGLIGSPCIVFTWQAAYC 
EESGGVWPGGETEHTQEGLVQ 


9130 


23031 


A 


9197 


3 


91 


QGLS LGGGG CS E P G S CHCT P AWATE RD S 
I 


9131 


23032 


A 


9198 


1 


203 


RTRGNKS IDDTSNFDDFPESDILQPVPN 
TTEPDYKSKDWVFLNYTYKRFEGLTQRG 
SIPTYMKAGKL 


9132 


23033 


A 


9199 


2 


144 


PTRTMPCCCCCCCCCCCCCRCLFMTGVL 
FTCCHNQFENHSGCLPRQI 


9133 


23034 


A 


9200 


1 


99 


YAAKE L VHDR YRP T I E ME RRRGL W WL VP 
RLSLE 


9134 


23035 


A 


9201 


3 


149 


LNAVQRMEWQLKVQELDPAGHKSLCVNE 
VPS F YVPMVDVNDDFVLLPA 


9135 


23036 


A 


9202 


284 


21 


KG P PL F F F F F FLRQ GLVLWHRLE YS GMT 
TAHCNLE LLVS SDPPTLSS QNAE T TGVS 
HCAQPASLILSKPYIGFLHQIQKSCLPV 
TFL 


9136 


23037 


A 


9203 - 


1 


77 


LRLVLNS WRQAILLL WPPKVLGLQA 


9137 


23038 


A 


9204 


529 


340 


GVLLWRQAEAGELSLGGGGCSELKVRHC 
TP AWATRAE T PS QK I NE L I KS MAVE LTD 
DFIT 


9138 


23039 


A 


9205 


413 


241 


HPPPPELKLFSCISLISRWGHRHRPPCP 
YVFVLLAETGFHVGQAGHKHTYCCVSLP 
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residue of 

peptide 

sequence 


KNLysine, L==Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














T 


9139 


23040 


A 


9206 


33 


152 


QLQGLE S WGMERCALP6YP WFTNLCKY 
RSWIEETMRDK 


9140 


23041 


A 


9207 


70 


402 


HVDWE ALGND P RLVNLGI C KL KAS G IHE 
WVHTRTCAHTHMHTHTHTCTHCRS PRW 
VLNLGCTFELPGALKNARAYPHPNATKP 
MTSESVGIRPTRQVFKAPRAEQHCSTA 


9141 


23042 


A 


9208 


46 


262 


GG I P P AAVRGGRS WLRD VL S RLAAAGRR 
RRGTRRGRGRAGGAGGGRDRVGS RREPA 
AARTAAEGERAVRGKC 


9142 


23043 


A 


9209 


15 


206 


PIEPLSERTNSAGAPPPASLPPCSLISD 
CCASNQRDSVGVGPSEPGDSDMQTSLWA 
KVPVSKD 


9143 


23044 


A 


9210 


475 


110 


NLLAPHLLVSLEKHS IRMGVAQFSRCHL 
SQLPLARKENSLTPCASGVRRCLALLRL 
MLGGLHPLSCTHCPTGPSEMNLVSQLEM 
QKSPVFCVAHAGSCRLELFLFGHLPL 


9144 


23045 


A 


9211 


420 


132 


PPPPNYFGGAGFFPPPPFFLRPPPPFFF 
LPPKKKKIFPPRGPKKIFFIiKAPPPSFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
LVRSHFILTIM 


9145 


23046 


A 


9212 


353 


87 


FFFXKQXFFFPSLFFFSPPXXFSFYPF* 
YLKYPLLPPSFFLFFFTPPPPXFFFFFF 
FFFLFPPPFFFFFFFFFFFFFFFFFFFF 
FFFFD 


9146 


23047 


A 


9213 


2 


276 


F I QLLKRRLiCE LG YE VL PHP S Y S HDLL P 
TNYHFFKHLDNFLQRKFFHNQQDAESAF 
QEFAESQSMEFYATGINTLISPWQKCVD 
CSVSYFD 


9147 


23048 


A 


9214 


402 


276 


LSCLSLPSSWDYRCPPPKPGYFFYFFLF 
LVKTGFPHLAQAG 


9148 


23049 


A 


9215 


386 


37 


GFKKRVCSPFFFFFFFFFFFFFFFGREG 
FFLQKKKVTCNNPS SNKLLTLGLQGKKE 
IQPAIiMKKNSISQS 


9149 


23050 


A 


9216 


1 


142 


KDFRVQVWYNHSSLQPQPPGFKQSSRLS 
LPKYWD YRHE YLHC I QLAP 


9150 


23051 


A 


9217 


402 


181 


FFFRXFFFFXXFFXXXXFFFFFXXXXXX 
XXXXXXXXXXFFFFXFFFFFFFFFFFXM 
FQQEKLLFIFYVNIWTF 


9151 


23052 


A 


9218 


2 


87 


SGSLIFGLIYICMCVCVCVCVCVCVYPC 


9152 


23053 


A 


9219 


420 


271 


YYC I FRE TGFHHVGQAGLELLTSGE PPA 
SASQSAGITATAPSPFLTFLLV 


9153 


23054 


A 


9220 


139 


413 


KKKKKKKKKKKKKKKKKKKRGGGAFKKK 
IFFRGGGGNFFFFLGPQKNIGGGGLKKR 
GGGKKPPPPQIFFFVG 


9154 


23055 


A 


9221 


134 


236 


LHRWGPAMLPRI ILNTWPQVILLPRPPK 
VLELQA 


9155 


23056 


A 


9222 


364 


280 


S THL S L P KYWD YRHE P P CLAKKDAQ S F S 


9156 


23057 


A 


9223 


1 


147 


TLLPMLESSGTIMAHCSLNYLGSSDPST 
SASQSARITSVSHCTQPPLPF 


9157 


23058 


A 


9224 


1 


208 


AGAPPPASLPPCSLISDCCASNQRDSVG 
VGPS E PGAGYHL WRRLSQS E KRS IRVG 
VTRFSSLLLSSHA 


9158 


23059 


A 


9225 


1 


145 


FQHVGQAGPKLLTSGDLPASASQSAGIT 
VSNRVRP KTKNF YTGNTRPN 


9159 


23060 


A 


9226 


396 


210 


KCWD YRHE P PYPALTNLLF I EMGS YGVA 
RAGLKLLGS SNPP I S ASQSAGVPGLSHH 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C~ Cysteine, D=Aspartic Acid, 
E-Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQPIF 


9160 


23061 


A 


9227 


2 


154 


FHHFGQAGLELLT S SDLPTS ASQSAE I T 
GMSHHTQPRMVFFSNFSSTAQC 


9161 


23062 


A 


9228 


1 


238 


FFLLALGIFILHAIYLSIYLSIHPPSMH 
PSLSIYLSIYLSIYLSIYLSISLSLSLS 
I YLS I YLSI YLS I YLSDFLSSAR 


9162 


23063 


A 


9229 


3 


65 


RGDL I TEPFLPKLLTKVKGS 


9163 


23064 


A 


9230 


25 


188 


YFRTAWNQAWWYVP I VSATWEAEAGGLL 
EPTSSRLRCAVIAPVNSYCTSTWAVS 


9164 


23065 


A 


9231 


44 


232 


YDLKFNNHVVFYFLFETEFHSLSKVECN 
GAISAHHNLCLCDGARLHHRRGGKKTKK 
QKTLPS 


9165 


23066 


A 


9232 


3 


195 


FFVGLGS P Y WQTGLELLCSGNPPASAS 
LNAR I TGMSHQVQPNTGRLS F YLGS S I Y 
GSIHAMI 


9166 


23067 


A 


9233 


380 


244 


S VPLAEMGFHHVGQAS PELLP P VIHS S Q 
PPKAIGLQARATAPSLH 


9167 


23068 


A 


9234 


11 


101 


LLLINTLMFIHLSPILLLSLNPDIITGF 
EG 


9168 


23069 


A 


9235 


411 


226 


FFFFFATNSQGGPTSFWGPRNNAQVAGP 
LRNKIFTPVGSPKKIFPSWETPVF 


9169 


23070 


A 


9236 


404 


262 


QGLAMLPRLVSNSWTQAILLPWPPNQSA 
GGS GVSHHTQ P I F I S EN I H 


9170 


23071 


A 


9237 


369 


152 


HAFFVS IWRSFHVSNSYMTVTVQMYHNL 
S I PLPDPHFFFRGGGLAMLPRLVSNPWP 
QAILPPQPPKVLGLQA 


9171 


23072 


A 


9238 


389 


190 


RPKKKKKFFPPPGGKGVFFKGAPPPYFL 
NF1 FFFFFFFFFFFFFFFL I YTHNFL I L 
FLNCDKLYIT 


9172 


23073 


A 


9239 


330 


3 


GGMEPPFFPPTGAQWGYFGSLQNSPPEF 
KQFL CLRLPNVLE YKS IPP CPTNFCS F Y 
KNGIFPCWPGWPFKFQKKKKISWVWWCA 
LAEAGGSLEPRWRWRLQSAKMAPL 


9173 


23074 


A 


9240 


122 


248 


KKKKKRGGLTPVI PPLWEAKGGGSRGQK 
IQTILAGGGKAPLY 


9174 


23075 


A 


9241 


2 


286 


F F VQT E FHH VAQAALQ LVS S RD P P TS AS 
QSVDITGVSHCAQPVIVSCDDQVTLVQR 
RQVTVTS L I AF CQVAYPKGAKS QWPQS F 
FFSSPFLFSL 


9175 


23076 


A 


9242 


2 


95 


HMKKCSSSLAIREMLIKTTMRFFFFKLI 
TCY 


9176 


23077 


A 


9243 


377 


293 


PLSPRLVLSSWAQMILLPLPPKVLGSQA 


9177 


23078 


A 


9244 


422 


147 


KKGGTQGFSPPRVFKRGPRNFFFGPPKK 
KKNPPPPRGKIFFFLKGPPSFFFFFFFF 
YFFFFFFFFFFTHKYFLLGDLLLIYHLS 
KKNYSFF 


9178 


23079 


A 


9245 


402 


208 


T IGRCHHTQLNLKFFVEMGSHYWQAGL 
KLIiDS TNP PS S AFQS AGI S GVNP YAWTE 
IMFLYSIPI 


9179 


23080 


A 


9246 


314 


168 


EPYHLHFLPPHHYYYCRWGLIVLPMLVS 
NS WAQVI LLP WP P KVQGLQ V 


9180 


23081 


A 


9247 


3 


320 


WQAGLKLLSS CDQ PAS AS QSAVITGVN 
HHARPLIFLYEMDYGSRTVLI IRKWDDN 
EQSNRT I FLKKDELTGNRSACLKYTPMD 
INVNSGLGWRYRFGMHHQCRW 


9181 


23082 


A 


9248 


22 


303 


KGSYFS FT YEAQTES CS I ARLE CRGAI S 
AHCNLRLLDSSNSPASASRVAETTDHKT 
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Amino acid sequence (A=Alanine 
C— Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














LVPERSCFTPKERNVIQKGQEESEEMSN 
HLFTWLPILH 


9182 


23083 


A 


9249 


13 


174 


LGLKVRHRL WL I FVLS VE TE FHHLGQAG 
LELLTSGNPS AG I TGVSHCARPTSHS 


9183 


23084 


A 


9250 


3 


141 


HVGQTVDRLLTSGDPPTSASESAGITGV 
SHRPRQIFISLFTICIR 


9184 


23085 


A 


9251 


56 


262 


LRIPLTFGGWKGEEEDRAVMHLRLCGD 
DSPMSESLHFIKLPRTVLVDEDARPEKK 
KKKKKKKKKKKKK 


9185 


23086 


A 


9252 


401 


275 


PADLF I F I VJEDGGLTMS PRLVMNSWSQA 
TLQPWTPKVLGLQV 


9186 


23087 


A 


9253 


411 


258 


PKVFKYRCDPPRLAWFLFFVQSLLPRLG 
LNS WAQAFL P P W P P RVLGLQ A 


9187 


23088 


A 


9255 


391 


167 


GLKILTSGDLPASASQSAGNTGMSHCAR 
PLMS FKIRYFLFT I KMKLKLLWPVWS A 
GVHPPHFSCPIVITNKPRI 


9188 


23089 


A 


9256 


9 


255 


F F CRYGGLAVL P RLj VSNS WTQ A I L S P W P 
LKVLGLQACTHHFFLiVIVNSNAARIKTIE 
MLLSEDVNFFVALLCPSFTISHFHL 


9189 


23090 


A 


9257 


487 


42 


RYF I FLFYF I FKTETGS YYVAQARLRLL 
GSSNPPALAFQSARITAVSHRAQLTCIK 
YTFLFTHIP 


9190 


23091 


A 


9258 


1 


243 


ILSSGGSFISLTAVILIIFMIGEAFASK 
RKVL I VE EPS INLE WL YG C P P P YHT FEE 
PQPKKSEIYSKCFFFSVFVHFGGWG 


9191 


23092 


A 


9259 


411 


2 


FFFFSPPVSLSLCVLESSPPPFFFFSPP 
PKKGFFSPPFFFFSPRVFPPPVFFFPPP 
PFFFFPPLKKNFSPPPPGKIFFFFFRPP 
PFFFFFFFLFFFFFRPFFFFSFFFLRWE 
EVLYF IMCHHI SDAYPPNVDMNKE 


9192 


23093 


A 


9260 


417 


219 


SVEMGFLHVGQAS IKLPTSGDPSASGSQ 
SAGITGLSHCARPHSQLS IVCCYSVPVQ 
CLHSLAARY 


9193 


23094 


A 


9261 


408 


172 


C PNH FRKWGHRG C P P PRVF F F VGRDG V Y 
VAQAGLELLDLSNS PASASQSNGTTGMS 
HLGQPLLFILNDVTAVIPLQGSF 


9194 


23095 


A 


9262 


401 


132 


HQDNKPAPPGFFFFSPPPKKGFFSKPFY 
FVSPRFFPFPPFLNPAPLFYFWGPKKKI 
FFFPPPAIKFFFFKRAPPLFFFFFFFFF 


9195 


23096 


A 


9263 


393 


220 


LFFLNKIPFFPPGGSKGGDFGPHTNPPP 
GVKPIFRPHPPEEGGPKGPPPRPREFFL 
GF 


9196 


23097 


A 


9264 


206 


438 


FP I L KTHFRLGT VAHT YNPTTLGGPGGR 
NTCSKFKTSLGTWGDPFFHKSENCLGWG 
VPNCGPCFWGAQQKSAWSEGL 


9197 


23098 


A 


9265 


333 


48 


ALLAGPTPFFFLGKKKTRFGNRALFFFF 
FFFLNE VLrLCNP GWIMWA I HR CDHGIh 
QPPTPGLKRSS CLS LS SSWNYKLKPLHP 
AGVRFLDFVS 


9198 


23099 


A 


9266 


3 


275 


TGFHHIGQAGLKLIiTLGDP PLSASHSAG 
I TGVSHHAWPMHLES YWRVQYPFMPQVL 
YFWAPSGAQHGAQI IENAIPTLLSRPQE 
EPASEA 


9199 


23100 


A 


9267 


49 


226 


HGPEPCMDADAQREGINVSARYPKNWES 
TVDPAREFTMIYSAPLMLLADPDEFASV 
QliA 


9200 


23101 


A 


9268 


159 


236 


PRLVL I S WPQAI L P P WP IK WGLQA 


9201 


23102 


A 


9269 


1 


208 


I FN YC VF S S GVALAYG I YKQD L P ALKE K 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, ""—Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 














PRGWFVDMGHSAYQVSACAFNRGQLKV 
SIYFFSRRLLWW 


9202 


23103 


A 


9270 


1 


209 


IFNFCVFSSWALAYGIYKQDLGALEEK 
PRNWFVDMGHSAYQVSVCAFNRGKLKV 
SIYFFSRRLLWW 


9203 


23104 


A 


9271 


379 


263 


ESCSVGRLECSGAVPAHCNLCLCLPDSS 
GSPASSSRLC 


9204 


23105 


A 


9272 


312 


3 


FRREGLAALQPWWPNKVSSKKKRKKERE 
RKKERKKERKKERKKEDGATADGISEAE 
VIRVHPYPVGFGVLSEGDIWTEDMRAGR 
RPHAGESGDTVMLPQAQE 


9205 


23106 


A 


9273 


405 


246 


PTKKNKYLHAPPLFHFFFIIPNPNFFFF 
FFFFFFNFFFFFKTKGCEFLYFSY 


9206 


23107 


A 


9274 


2 


227 


LVETGFRHIAQAGLQLLSSSYPPTSASQ 
SVGITGVSHQPGPSFHFERKSLGLKTLL 
LCVKVGVRACFALFFQGSS 


9207 


23108 


A 


9275 


70 


304 


NLFLKASTYPSEHLYHLPPTP I VYIRKR 
LDAMAHACNPGTLGDRGGWIARTPDLKC 
S AYRGL S KCWD YRHE PPRP VLV 


9208 


23109 


A 


9276 


2 


260 


T VF YHVGQAALE LL T S GGP PALAS E S AG 
I TGVS HHAQPFLL I L VI S L FP CNQ S FHI 
SPYRFSFTQLNHGFNTVLYPLVINISEI 
FF 


9209 


23110 


A 


9277 


406 


330 


PRLVLNSWTQTILLPWPPKVLGLDV 


9210 


23111 


A 


9278 


2 


160 


ELLTS SDPPASASQSARITGMSHHTRLK 
YIVYRVSRKS KNI PLKHIG IQS FP 


9211 


23112 


A 


9279 


1 


91 


GLTVFPRLVWNSWAQMILPPWPPKMLGL 
QV 


9212 


23113 


A 


9280 


3 


107 


LGETGFHHVGRAGLELLNKVMHPPWAPR 
VLGLQA 


9213 


23114 


A 


9281 


336 


234 


AHYKLLLKI KLPGQAQWLTP VI PTLWEA 
EAGRS 


9214 


23115 


A 


9282 


3 


241 


DRPE P CWDADAQREG INASARYPKNWVT 
TGD PAREFTMIHTAPLMLLADPDE I ES V 
QLAQSWPFGAIASLGRSPCRKRN 


9215 


23116 


A 


9283 


404 


277 


IFFVETRSHHIAQAGLELLGSSDPPYFF 
QVFTEMSLSQRALF 


9216 


23117 


A 


9284 


399 


231 


H Y VAQAS L KLL YS GD P PAS AFQ S AE I AG 
VSHCTWPTGSLNKIQNNIHSRISSM 


9217 


23118 


A 


9286 


394 


231 


AFFFFFFFGERGFHHVARTVLKLLDSSN 
PPAWSQSAGITDVSHDFLISLYRIY 


9218 


23119 


A 


9287 


420 


311 


LNFCRDGLAMLPRLVLNSWAQAVLSPQP 
PKVMGLHA 


9219 


23120 


A 


9288 


398 


173 


LLSFFYLFFYFFFFILFFLFFFFFFFFF 
LMFKALLFS CHYFPAMVTFKFSLS I SGV 
VRTSGYIMLFSYLCLLKTM 


9220 


23121 


A 


9289 


270 


23 


TPEQCRGIKLFIYSLILYVRDGVLLCCS 
GWPWAIHKCDHSTLQHQIPSLKQASCL 
SLLSSWEYRYAPLHLASANFLKVTW 


9221 


23122 


A 


9290 


273 


46 


HLLNMPLFTEIWTERLFLFYILCFVFLV 
ETGFFYITQAGLKLLTSSDPPAYASQSA 
EIIGTSHCTRPICYVNVTNI 


9222 


23123 


A 


9291 


424 


230 


CLPFTHSLSLSLSLSLSHTHTHTHTHTH 
TIYMNK 


9223 


23124 


A 


9292 


2 


146 


NLFLVETRSHHVARASFKLLSSSNPPTS 
ASQSSEITGASHRTWPEVFF 


9224 


23125 


A 


9293 


393 


202 


GRERERKREREREKERKKERKKERKPER 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
1 — threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X—Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














T\ C? O T rTTVTT^T 7~0 TP T TP 7\ 7\ ATT /" ~1 ~X 7~ 7 ~> T~i 7 T T7~~T\ €~ I T/"m 

PbGoJjl xCvKiiGJ. 1 J\ANhGKKRHKJ\SKT 
LISHKNKN 


9225 


23126 


A 


9294 


252 


193 


GFCCCCCCCYCCCSDKRHPG 


9226 


23127 


A 


9295 


409 


269 


SQYFGRLRRVDCLS SGVQDQPGQQGETP 
SLLKYKIYIHISWAGWHV 


9227 


23128 


A 


9296 


2 


213 


FCCGLLIHLIYCKTEYSGQSTVAIHRCN 
HSVLQPQTLGFKQFSHLSLLSSWDYRYS 
PLCPAYIFNLSIGY 


9228 


23129 


A 


9297 


169 


408 


RFARHMISTS PHEERL P PPAG I RVRP PPQP 
AASLGPPRMAPSISRRFQMTPYYKHNDF 
AGQAQWLMP WPAL WEAEAGGS S E 


9229 


23130 


A 


9298 


199 


51 


LHP PPPSLtSLSLS HTHTHTHS HTHTHTH 
TLWLGRPGPAQDWLVWGNLT 


9230 


23131 


A 


9299 


1 


176 


MGSHSLAQARMQWCNQSSLQPQPPGLKQ 
PSPFSLPSSWDYRHVPPCLASIFFLKLN 
YH 


9231 


23132 


A 


9300 


412 


209 


MGFHHVGQSGLELLTSGDPPTSASQSAG 
I TGVSHRNWPHCGFNLVFLMPDDVKHLF 
MGILAICVSSL 


9232 


23133 


A 


9301 


195 


347 


K I KG FRP GATAKAGNP S ALGHQGGRTA I 
AWEFTISMSNMGRPCLYRKYKN 


9233 


23134 


A 


9302 


386 


190 


GFLFFFPPPKKEVFPHPFILGFPGFFPP 
PPFFKRAPEFFFWGPKKKKNSPPPRALN 
FFFFKGAPP 


9234 


23135 


A 


9303 


27 


173 


S FQLWGHLKVPRIRPKRADHLRSGVQDQ 
P D QHGKTLS LLKKYK I S RAW 


9235 


23136 


A 


9304 


374 


121 


QWCDHGSLQPQTPGLKQSSCFSLPKRWD 
YRREPPHPTSLSPYIVAWPCTLCPPPLS 
ALWKHL I HQ KCD G VM VTjQ VP GRTHNVTL 
T 


9236 


23137 


A 


9305 


70 


196 


PQVIHPGQAGLELLTSGDLPASASKSAG 
I TGVSHHAQ PISHF 


9237 


23138 


A 


9306 


378 


256 


QKWVS I CCPGWSQTFGLQES SCLGIjPKS 
WDYRRQPQPCLTL 


9238 


23139 


A 


9307 


206 


3 


PCHLGWSAWQSWLTAASNSWPHAILPP 
QPPKVLGLQASLKFLKAGITQVTFSDYN 
KILNKEIKRGR 


9239 


23140 


A 


9308 


338 


161 


HFKKKIYIYIYVYIHTYIYIYTHIHIYT 
HTYTYIYRYIHIYIHTYIYMYVYIYIPS 
LPF 


9240 


23141 


A 


9309 


503 


288 


FFLNLDKSQEWWLMPVIPTLREAEAGGS 
LEVRSSRPASSDPPTSASQSAGIIGMRH 
HARPWLRENGR I QN 


9241 


23142 


A 


9310 


418 


309 


FFCKNGISICCLPKLVSNSWPQAILQPQ 
PPKVQGTIi 


9242 


23143 


A 


9311 


442 


275 


VSPPPLGGGFLSPPFKPPPFFFNLKGGP 
PFPTIPFLIIKGKKFCPPPPFFFPGLF 


9243 


23144 


A 


9312 


2 


95 


HENRL S LGGGGC S E LRL CHCT PAWATER 
DFI 




z3145 


7\ 
£\ 


93 13 


318 


154 


TP TP TP £? TP TP TP TP TP TP TP TP TP TP TP TP TP TP TP 11' TP TP TP TP TP TP 1 . » ■ . i 

FFFFFFFFFFFFFFFFFFFFFFFFFFFI 
FIIiFIYSLLYWCVGVLGS 


9245 


23146 


A 


9314 


388 


199 


EQPKCVHRIKQIWHICALEYYAAFKKKK 
VLPYAVIQ I KLEDTMLS E INQS WGKYCM 
IPLIGGI 


9246 


23147 


A 


9315 


2 


97 


S T P GL KRS S H I S L P KYWD YRRGP PGP AS 
ILF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SJEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, 1= Is o leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


9247 


23148 


A 


9316 


3 


235 


TMWNAEGI P IRTTLDNSTTVQYACLLH 
HLTMKAKSTVRDIDPQNDLTFLRIRSKK 
HEIMVAPDKEYLLIVIQNPCE 


9248 


23149 


A 


9317 


253 


71 


LFFFFLFFVDTKFHHLAQDGFQLLSS SN 
S PTS ASQSGG I TGRSHQAWP YVHLLTLQ 
KLNY 


9249 


23150 


A 


9318 


425 


95 


CLLFCPKKKKKKKKKKKKKKKKKKKKKK 
KKKGG P I KKKKKQ P 


9250 


23151 


A 


9319 


25 


214 


WIDDRPEPCMDADAQREGINSSARYPK 
NWVTTGD PARE CGM I HSAPLMLLAD PDE 
FVSVQIA 


9251 


23152 


A 


9320 


50 


231 


GHR PEP GRDADAQRE G I IASARYPKNWV 
TTGD PARE CTR I QSAPLMLIiAD PDE FVS 
VQLA 


9252 


23153 


A 


9321 


401 


226 


Q WL FTNMI I GHCS LELLGP SNP P I S AS R 
LAATTGTRHHAQPS KFRNLKGWLCAKRA 
RT 


9253 


23154 


A 


9322 


410 


159 


LGSLQLPPPGSKRFSRLSLPSSWDYRGC 
SELRLRHCTPAWVTEQDSVPPKKKKQI F 
RPFPFSYGESLGVGLRHLCFSKPSRRF 


9254 


23155 


A 


9323 


310 


25 


KWGGLFFFSPPKKRGFFKQVFFCVPPVF 
FFPPFFKTGPGDIFLGPLKKKI IFPPPG 
GKKFFFKRGPPPFFFFFFFFFFFFFFFF 
FSEVECVCLCR 


9255 


23156 


A 


9324 


406 


289 


PGFYHAGQAGLKLLiTSGNPSASASQSAR 
ITSISQRTRPH 


9256 4 


23157 


A 


9325 


1 


116 


LHSVLLPFFYFCVCVCVCVCVCVCVKHK 
YVRAGGAPGR 


9257 


23158 


A 


9326 


2 


158 


F I FLFT VGGLTG I VIjANSS LD I VLHDT Y 
YWAHFHYVLS IGAVFAI IGGFIH 


9258 


23159 


A 


9327 


3 


107 


RGC S E PRLRH C T P AW VTEGD CVS KKKKK 
VPEAWG 


9259 


23160 


A 


9328 


257 


2 


GFFSNISASVIIPPPISRKNLFFPFRVY 
FGPPPKTFKGPPPFFFFFFLSPRL.ECSG 
VIMAQCSLTTiPGS SDLPLSAS CVAMTVS 
I 


9260 


23161 


A 


9329 


66 


384 


GNLCGWMKSMAWDNP WRKADWGQIRE S 
LECQPNTGMLKKKKKKKKKKKKKKKKKK 
KRGGPFLKI SWGGP I LGGRGKI FFFFFG 
GGILTPRGVFWKSPFFWGGKKK 


9261 


23162 


A 


9330 


1 


98 


IIGQVASVLYFTTILILIPTISLIENKI 
LKWA 


9262 


23163 


A 


9331 


33 


412 


NLFYVMAE ILHTHIQLCCQVKL.HLSVPR 
P S L P LLCCWNF S LVLHEHS Q PAS E KKKK 
KKKKNKKKKKKKKKKRGGVFKKKPGGAP 
FCGGGENFFFFFFGGGLKKPRGVFGKNT 
FFWGGEKRPPPPPY 


9263 


23164 


A 


9332 


401 


311 


PTRPVDLLVDDSVKESLIGYGPTEDSKE 
FS 


9264 


23165 


A 


9333 


5730 


452 


TRSAPRAASWLEDPREVRSVCLSATFFT 
LSLQKPRLLLFSPSWHLGVPLSVGVQIi 
QDVPRGQWKGSVFLRNPSRISnWPCSPK 
VDFTLSSERDFALLSLQVPLKDAKSCGL 
HQ LLRGPE VQL VAHS P WLKD S L S RTTN I 
QGINLLFSSRRGHLFLQTDQP IYNPGQR 
VRYRVFALDQKMRPSTDTITVMVENSHG 
LRVRKKE VYMPS S I FQDDF VI PDISEPG 
TWKISARFSDGLESNSSTQFEVKKYVLP 



1106 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A—Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=lrysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *-Stop codon, /=possit>Ie 
nucleotide deletion, \=possible 
nucleotide insertion 














NFEVKITPGKPYILTVPGHLDEMQLDIQ 
ARYIYGKPVQGVAYVRFGLLDEDGKKTF 
FRGLESQTKLVNGQSHI SLSKAEFQDAL 
EKLNMGITDLQGLRLYVAAAI IEYPGGE 
ME E AE LT S W YF VS SPFSLDLS KTKRHLV 
PGAP FLLQALVREMSGS PAS G I PVKVSA 
TVS S PGS VPEVQD I QQNTDGSGQVS I P I 
I XPQT I S ELQLS VS AGS PHPAIARL T VA 
APPSGGPGFLS IERPDSRPPRVGDTLNL 
NLRAVGSGATFSHYYYMILSRGQIVFMN 
REPKRTLTSVSVFVDHHLAPSFYFVAFY 
YHGDHPVANSLRVDVQAGACEGKLELSV 
DGAKQ YRNGE S VKLHLE TD S L ALi VALGA 
LDTAIiYAAGSKSHKPDNMGKVFEAMNSY 
DLGCGPGGGDSALQVFQAAGLAFSDGDQ 
WTLSRKRLSCPKEKTTRKKRNVNFQKAI 
NEKLGQ YAS PTAKRCCQDG VTRLPMMRS 
CEQRAARVQQLDCREPFLSCCQFAESLR 
KKSRDKGQAGLQRALEILQEEDLIDEDD 
I PVRS FFPENWLWRVETVDRFQ r LTLWIi 
PDSLTTWEIHGLSLSKTKGLCVATPVQL 
RVFREFHLHLRLPMSVRRFEQLELRPVL 
YNYLDKNLTVS VHVS PVEGLCLAGGGGL 
AQQVLVPAGSARPVAFSWPTAAAAVSL 
KWARGS FE F P VGDAVS KVLQ I E KEGAI 
HREELVYELNPLDHRGRTLE I PGNSDPN 
MIPDGDFNSYVRVTASDPLDTLGSEGAL 
SPGGVASLLRLPRGCGEQTMIYLAPTLA 
ASRYLDKTEQWSTLPPETKDHAVDLIQK 
GYMRIQQFRKADGSYAAWLSRDSSTWLT 
AFVL KVL S LAQE QVGGS P E KLQ E TSNWL 
LSQQQADGSFQDPCPVLDRSMQGGLVGN 
DETVALTAFVT I ALHHGLAVFQDEGAE P 
LKQRVEASISKANS FLGEKASAGLLGAH 
AAAI TAYALTLTKAP VDLLG VAHNNLMA 
MAQETGDNLYWGS VTGSQSNAVS PTPAP 
RNP S DPMPQAPALW I ETTAYALLHLLLH 
EGKAEMADQAAAWLTRQGS FQGGFRS TQ 
DTVI ALDALSAYW I ASHTTEERGL.NVTL 
SSTGRNGFKSHALQLNNRQIRGLEEELQ 
FSLGSKINVKVGGNSKGTLKVLRTYNVL 
DMKNTTCQDLQ I E VTVKGHVE YTMEANE 
DYEDYEYDELPAKDDPDAPLQPVTPLQIi 
FEGRRNRRRREAPKWEEQESRVHYTVC 
IWRNGKVGLSGMAIADVTLLSGFHALRA 
DLEKLTSLSDRYVSHFETEGPHVLLYFD 
S VPTSRE CVGFEAVQE VPVGLVQPASAT 
LYDYYNPERRCSVFYGAPSKSRLLATLC 
SAEVCQCAEGKCPRQRRALERGLQDEDG 
YRMKFACYYPRVEYGFQVKVLREDSRAA 
FRLFETKITQVLHFTKDVKAAANQMRNF 
LVRASCRLRLEPGKEYLIMGMGLDGATY 
DLEGHPQYLLDSNSWIEEMPSERLCRST 
RQRAACAQLNDFLQEYGTQGCQV 


9265 


23166 


A 


9334 


46 


324 


S KYL P P V I T FQHE I W VGTN I QT I S DWVL 
I GCPGWS AVTQSRLAAASTS WAQAMSS S 
S CLGLSSS C YYRHTPPCLANFVAFVETK 
LYTMLPGLS 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E—G\vtamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=~Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9266 


23167 


A 


9335 


338 


178 


EVDIAVSHRRKQAFNLCLLKCKPHNNCL 
NLEGGGSSELKLCYCIPAWATDQSL 


9267 


23168 


A 


9336 


396 


254 


KLS TDCTSKE I INRHPTEWEKI FANYAP 
NKGPTSTTHSSCLMFPYLF 


9268 


23169 


A 


9337 


454 


359 


SSWDYRHAPPCPANFVFLVETI FIQGE I 
GLL 


9269 


23170 


A 


9338 


540 


328 


I KKRGAP RRGVL F F F TRKKKF F S Y YKKS 
T KE RVS FL F VAPG C I Q GGKKKKP PT KKK 
KKKKKKKKKKKKKKK 


9270 


23171 


A 


9339 


21 


209 


AEE F P D F KLY YKANQNT YRGQ WNRTE T S 
EITPDIYLGIQLVSNSWPQVILLPWPPK 
VLGLQA 


9271 


23172 


A 


9340 


345 


2 


KIFFFLIYILFFFNFFFFFFFFFFFFFF 
FFFFFLENRWFLMENFIKGRGHVKGPLG 
Q ARDHD I S GRRWWRRE Q CRAS QL PGVRR 
S LLQ VQAGGP VH IRIS GLQDARQ GPRVA 
GA 


9272 


23173 


A 


9341 


15 


252 


LERVIRQSAALTDSFFNFILLILFIYLF 
FETGS PCVSQAGLKLLGSRDPPVSASQS 
ARIIGVSHRAQPALTDFPSISF 


9273 


23174 


A 


9342 


484 


276 


RHAPPCQANFFVQTRFCHVAQSGLKLIR 
S RYLFAS AFQS VR I T CMSHCARPLYFLM 
FKRLQYLKSPTKS 


9274 


23175 


A 


9343 


464 


326 


LVETGPHHVGQAGPELLAPCDPPAPASQ 
S AE TTGMSHRARP P ANFV 


9275 


23176 


A 


9344 


486 


280 


S FSHSHS CS LPS S WDYRCTS SHLANFFF 
KRWGFTILPRLILNSWPQVILLPQSHRH 
EPLCLETLKTLNV 


9276 


23177 


A 


9345 


276 


179 


DRDFTMLPRLVLNSWPKTILLPLPPKIL 
RLQS 


9277 


23178 


A 


9346 


45 


143 


RWNSRQENCLNLGGGDCSEPSLCHCPPA 
WATE 


9278 


23179 


A 


9347 


229 


73 


PPPHMIFFFFFFLFFFYFFFFFFFFFFF 
FFFFFFFFSRLQVSVYCSSVLNKNVD 


9279 


23180 


A 


9348 


391 


139 


I PSLKRS SHLS LAKCWD YRHKP P CQVT F 
S FLNVR I YATKFFLS TPLAAWS K YWYL V 
FLLSFSS KYYLTP I S FLFLTHNVS SS I F 


9280 


23181 


A 


9349 


53 


191 


I YVTY I WYS E VSHAYNS STLGGQGGR I T 
GAQEFKTSLVNIAKPLLY 


9281 


23182 


A 


9350 


373 


235 


EEGELLEPGVQGCSELSWCHCTPAWGTE 
QDHCVSYlSnSFSNTHNNVNY 


9282 


23183 


A 


9351 


476 


82 


YLAT YLPTYLS I YLPTYLP IYLPI YLYL 
G S H Y I AQAGLKFLG WVGHNG S C L 


9283 


23184 


A 


9352 


286 


142 


ELQFKMRFGWRHSQTISWGLSLLPRLIS 
NSWAQVILLTWPPKVLGIQA 


9284 


23185 


A 


9353 


3 


102 


FFETESHSVARLEYSGAISAHCTPVIKL 
YLYT 


9285 


23186 


A 


9354 


3 


270 


TAGAPPRPANFCI FSRDGVSPCWPGWSR 
TPGLRCD P PAS AS QLLGRLRHENCLNLG 
(jiiljC 1 (jAi JLiAilu 1 o I Jj KACJA1 XjQ PQPb C 
PAPQ 


9286 


23187 


A 


9355 


1 


112 


GEGSAHCNLCLPGSSNSPASTSQVAGIK 
GVSHRARPN 


9287 


23188 


A 


9356 


1 


63 


VALIQAYVFTLLVSIjYLHDNP 


9288 


23189 


A 


9357 


244 


140 


QRRHTHTHSHTYTHTHTHIPNKKIPILT 
ILKCIA 


9289 


23190 


A 


9358 


464 


181 


LGAGGMTGILFSLGASMVLGGVAQMLAP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, FHPhenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Va!ine, 
W=Tryptophan, Y= Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KARTRR I QTTDNGKQNTY FS SLDNMVAQ 
GNVLP VI> YGEMRVGSRWSQE I S TADEG 
DGGQVWIGR 


9290 


23191 


A 


9359 


11 


469 


IDLRCRFRHEERERERERERERERERER 
EREREREREREREREGRAPPPPIiFFGGG 
GPPARAHALSPPHLGRPPLYTYFFSYVR 
AP PTERSARTQREARRALKS VC WRERE 
THSLPRARCVCSHPTGCGGAPGARVSLS 
QSAHCFLCVCGAG 


9291 


23192 


A 


9360 


22 


172 


GLSHEERERERERERERERERERERDTV 
CPLFCPTGGFILREARVKSGCLCF 


9292 


23193 


A 


9361 


15 


345 


P PNRP PAE FRTRHL I S S LL CLEG 1 1 LS L 
FX IATLITLNTHSLLANIVPIAILVFAA 
CE AAVGLALLVS I S NT YGLD YVHNQKKK 
KKTWGGGRAPNS P YGGRNNNPRPAV 


9293 


23194 


A 


9362 


61 


298 


SCDSPLQECC IHTHTHTHTHTHTHTHTH 
LFSDIKTHTWREWALSLFLLTEEEGGGT 
LFQGTLI FGHTLFWDKPLWQKEF 


9294 


23195 


A 


9363 


430 


2 


GGAGPPPPCWTQGPGVPPPTLSPRVNGP 
S LCSD FP S P KFYKE KRKKKGNKTP KR.KG 
KTLLPTETIVIiHVSPGCESKPGAGSADR 
RAPGVRDGGSDAS S S GTRAS ELGDGPVR 
VNFVLGDPGEAGRGFSMLGHPPGTSGPR 
TS 


9295 


23196 


A 


9364 


432 


291 


GITGLRHHSQLIFLFLVETGFHHVGQAG 
LELLIHQHRPSKVLRLQV 


9296 


23197 


A 


9365 


343 


507 ■ 


GQGLTLVPQVGVQWGNLSSLQLLPPGLK 
QSTCLSLPKCWDYKCGLPPPASFCFVL 


9297 


23198 


A 


9366 


417 


128 


MG W WPGRGAP PFLDEMAAGQRRS S LSRV 
GSQAEGLLTSHTMGGPAETLLTSQTGWR 
LGRGCNLG S L EGQ GRRLGGGG CNE P S S R 
HCTPAWAPLSTE 


9298 


23199 


A 


9367 


500 


285 


SLPRSWDYRRAPPRPANFLFLVEMGFHH 
VGKAVLEIjIjTS gvellts GDLPASASQS 
AGITGVSHRAWQIRE 


9299 


23200 


A 


9368 


1 


229 


IVDLTTWGIPTHTIHKYTHHIHKYIHTH 
AYTHHTHRHKHIYTYIHTDMVWLSPHPN 
LILNFSSHNPHVSWEGPNGR 


9300 


23201 


A 


9369 


451 


88 


KQTNIYTCIYMYVYIYTCIYTYVYIYTR 
TYTCIYVCVYTCIYVCVYTCIYVCVYTC 
IYVCVYTCIYVCVYIRVYTCVCIYVYIR 
VCVYTCIYVYVYIRVYTYICIYTCISTY 
IQCIYTHIYKVTKCSG 


9301 


23202 


A 


9370 


1 


208 


ADETIEQRGGI PKFVHDNYVIRRTAVDR 
GIPLLTNFQETKLFAEAVQKSRKVDSKS 
LFHYRQYSAGKAA 


9302 


23203 


A 


9371 


354 


202 


PAAQREG I TASARYS KNWVTPGDPAREF 
TMI QSAPLMLLAD PDE FVS VQLA 


9303 


23204 


A 


9372 


1 


336 


KSLIKYKGYQVAPAELESILLQHPNIFD 
AGVAGL PDDDAGEL P AAVWLEHGKTMT 
E KE IVDYVAS QVTTAKKLRGGWFVDE V 
P KGLTGKLDARKIRE I L, I KAKKGGKS KL 


9304 


23205 


A 


9373 


2 


187 


NILVTALYSLYIFTTTQWGSLTHHINNI 
KPS FTRENTLMFIHLS P ILLLSLNPD 1 1 
TGFSS 


9305 


23206 


A 


9374 


96 


240 


ACWLMPVI PALSEAE AGRLVQPRRLRAA 
WAT WQDL I S TKKKKKGGP F 


9306 


23207 


A 


9375 


3 


293 


STISAFTTSTTSATTTSTTSATTTSTIS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
Zo 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L— Leucine, M=Methionine, 
N=Asparagine, P~Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
i — mreomne, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *— Stop codon, /^possible 
nucleotide deletion, V=possible 
11 uLieuuuti nisei tion 














APTTSTTLSPTTSTTSTTITSTTSAPIS 
SNKLHKKKKKKKKQKKKKKRKNDDTKQD 
T T TK T Vf acta <3 T, 

ilX XvxixOCrvjljrO J-l 


9307 


23208 


A 


9376 


69 


160 


EHTHTHTHTHSHTHTHSHLPEEFFCVLN 
SL 


9308 


23209 


A 


9377 


336 


199 


EGITASARYSKNOTTTGDPAREFPMIQS 
APLMLLADPDEFVSVQLA 




ZdZWj 


a. 


93 /& 


3zl 


1 ""> Q 

lzi> 


DHRRE P PHPAYHS L YESGP S VQNGEE KG 
INPIIKMG 


oi i n 
yo 1U 


Z5Z\ I 


7\ 
/\ 


93 /9 


440 


ZOO 


EGECLSYLVFLVLV 


0"2 1 1 

1 1 


Z5Z\Z 


/A 


fil OA 

93 60 


TAT 

392 


\ S A 

154 


FFFNRDRVSPCLPRLVSNSWPQAILPPQ 


9312 


23213 


A 


9381 


403 


178 


GRRHE P CRDADAHL E G 1 1 F S AF Y P KNWE 
TTGDGAREWTMSNSAPLMLLADPDEFVS 
VQLA 


yj 13 


23214 


A 


9382 


303 


206 


DPVSiKICibljQPPPPGFKRFSGIjSLjPSS 
WDHR 


y$ 14 


z52I5 






215 


116 


SCGTM 


9315 


23216 


A 


9384 


2 


285 


ARHLLMQLSGGATLAISTIDLPSTLIIF 

LGDNTYKKKKRGAAFEETKFTSAGMQS K 
NFFLGTLNWR 




Zozl / 




y3oD 


1Z 


1 T3 

1Z3 


PVCWTJT T "DTT U r Pr\ r PU A "P" r PT-JT l tJ r PXJ r rLI , T l tJ T 

erX rljlNJfXjljir J. Jurl 1 y X JrLi\rilrl 1±1 IxilrlliiJL 

HSKDPMEY 


93 1 / 


z3zl o 


7\ 


93 So 


3 


23 U 


TT/^TaTOVD /~*T?TiT ID T 7\ TIT TJ r i X?'C"T'Tir^r^ T C5T <Ti C 

J^CWUxK^JCjJc , ijKJ_iAi J ijKL.r r 1 1 Pbijlob JLo 
GPALLSNASALAEVRGQQLCPPTPFLPT 
LIALTTLIiLPISPFILIIL 


yj lo 




jB. 






39o 


SWDYKCAPPCRADFCIFSRDGVLSCWPG 


7J 17 


ZjZZU 




7JOO 


994 
ZZH- 


fy 


PANTVKPRFL.KRKKKGKKKG 






A 


^/ J 07 


94Q 


79 

/z 


Ij WT i T T VKTC T T .F PVR T T? TFNK" T T)TTM T f^TT 

-1—1 v X J— 1 J — L V J.\.i>_ J_ J_li XT V XI X i\. ± X. i.<J XV X Jil'I X V3XI 

KTPTJSIKIiOMTE XT iO S VFSDYNR T KT »R A 


9321 


23222 


A 


9390 


400 


254 


LRQPGS TNS P VSAS RVAGITGTHHHTQ I 
IFVFLVEMWHACNPSYSGG 


9322 


23223 


A 


9391 


3 


64 


PPQKSDPPYKLVPVHLDS IL 


9323 


23224 


A 


9392 


266 


106 


LiTANFLFIaVETGFPHVTRVGLELIiASSD 
LPTLASQRAGITGWHLQNFWPGNF 


9324 




A 


7J7J 


din 




■pa ypn p p p irn JT.Q p p p p PP PO TT A P K a 


9325 


23226 


A 


9394 


481 


311 


LSLSEKRS IRVGVTRFSRCCPSPLSLTG 
KGNSLTPGVSQEIHSPPWFGIAYSVCTH 


yoZO 


ZoZZ / 




y3yj 


3 


13o 


\TT "MCTaTS niWTT .T OIaT D TJ TTT TT r\7\ r" 1 
V r iriXXj VXjl^toWi^yi v lXXJijL.VVlr'iriX 

TTRPSHFLPLLNSKYK 


9^97 
yjz. i 


9199R 


A, 


9^Q6 

7J70 






NFVFLVE TRFHHVGOAGLTLiLTS SDPA^ 

x» x v x j-j v J-i x xs.x xxxi v \j\£nsj±J x J jxj j. i— ' ox/ - rio 

QSAGITRVRPEILTLKRYHFPSGILGTE 
GMVIEMAYLLKV 


9328 


23229 


A 


9397 


2 


231 


LELiLiTSSDLiPASASQSAGITRGSHHARP 
SFTFRCVIHLEFNFMRGAREERLHFYCR 
WVIRRLTT FGRGYPS PAASD 


9329 


23230 


A 


9398 


383 


200 


QHYLNRRLSLAQWWMPWPATQEAEAGG 
LLEPRSSSLQC^MLPVISTSHCTLAWATH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last Amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K==Lysine, L— Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














REPHL 


9330 


23231 


A 


9399 


3 


416 


GFLHVGQAGLKLLTS GDLPTSAS QSAGI 
IGVSHRAGCLKS I YLTFPTRKFFVQMS I 
CDFKVEIEGKIGFISVPISLQSILHIVN 
NFFFFLETGSFPPGWRAWEKFGLIAPLA 
SGVKGIFLPLPRENLGWRPSPIPPA 


9331 


23232 


A 


9400 


400 


151 


FGYPQVFPLPPFLKPGPGIFFLGPLKKN 
FPFPPPGLKIFSFKRAPPFFFFVRKGLI 
MLLRLVSNSWAQVI I PPPPPEVLGLQA 


9332 


23233 


A 


9401 


324 


154 


FFFFFFKXXXXXLLFFFFFFFLFYIFFF 
FFFXFXFXFFFFFFFFFFFFFFFFFFFF 
F 


9333 


23234 


A 


9402 


413 


312 


PKLVLWSWLQALLLPRPPKVLGIiQARAT ' 
ITQPK 


9334 


23235 


A 


9403 


363 


3 


FKKKNFCLGGAGWFFFFYDPPPKKNIF 
YFPSKKIYFLSPASSSFFFFLCFPFFIF 
FFFFFFFFFFLENRWFLMENFIKGRGHV 
KGPLGQARDHD I SGRRWWRREQCRASQL 
PGVRPRV 


9335 


23236 


A 


9404 


401 


82 


SELPHDP INPTTGYI YPKKMKSVCFTD I 
GTPMFTTAIiFSIAKIWNQPKCLSKRITI 
FMFLVTYSQLFCTHYPILPYPFFPAPLS 
SSHNYHYTLYLYENILLDSTYE' 


9336 


23237 


A 


9405 


382 


219 


WAR WHG F L VP T S PGGEAGGS LE PGKRRV 
QLTQISPLPSSLGGKVGLPLKKKKNN 


9337 


23238 


A 


9406 


3 


195 


YL YS L VE TGFHHVG QAGLE L LIS RD P S A 
LASESAEITGMSHSIQPKSAFYIQIPET 
FWQPAFK 


9338 


23239 


A 


9407 


371 


1 


FRRLRL KNRLNLGNGGCS E P RL CHC I PA 
WETRAKLGRKKKEERKREKGTKMYNKMY 
KM YLE LNLTC P E PAS P KQMTRQLE VLS S 
NP S P GT FQ F I C KKQ FKFS C C FAIiR V S I Y 
LLMSVFEYCSC 


9339 


23240 


A 


9408 


326 


229 


RQGLTMLPRLVSNSWAQVILPPWPPKVL 
GLQA 


9340 


23241 


A 


9409 


81 


231 


LESAS FPPGKVAHTCNPNTLRGCAGWIA 

TaT R OTT T? TTT* G T r^ATTWlTk Tfor^T V7TTT 


9341 


23242 


A 


9410 


367 


256 


HPGWEWWLVSVI PTLREAKVGRSLEPRS 

JjKiJiiWL' 1 Wvr 


9342 


23243 


A 


9411 


2 


192 


VFLVEMGFLHVGQAGIiELPI SGDPPASQ 

q a n t Ta\7 q "ptj a r\ n t .t* T?T .TJ T "D T "NT l> q hd td tt 
orib J. JL V o rLrlft.{ x )£±±-i 1 r J-JlN ± ir J..DJ Jr orlF Jr JX 

ACCSKQL 










"3 
O 


Q1 
OD 




9344 


23245 


A 


9413 


371 


301 


E DH P RM VI LRLLR Y I VRLVWRMH 


9345 


23246 


A 


9414 


486 


337 


AGLELLGSSNLPALTSHIAEITGMCHHT 
WLjAYI F V C^FGWVIjIjIjDLCIjAF 


9346 


23247 


A 


9415 


378 


254 


IVAHCSLDLLGSSDPLTSASRVAGPTGM 

/-1TTTTT 7T.TT TTTT 7TT , TT 1 T 7f~* 

CHHVwLIVVFFVC 


9347 


23248 


A 


9416 


160 


309 


WRYSSNEFLLFLLFYLFKSNFTVFCCC 


9348 


23249 


A 


9417 


162 


339 


RPSSFCEFLHDSVPELALPMKARFEKSR 
RSQVSWLMPVIPTFWEAKVGRYQARSLI 
PA 


9349 


23250 


A 


9418 


1 


147 


GGGGCSEPRSCHCTPAWATRVKLGLFKK 
KKKINFLF VPP F I F LKS KTGV 


9350 


23251 


A 


9419 


283 


126 


KMMYKGQAQWL I P I I P AFWE AKERGLLE 
VRSLRPAWATEQDS TATKNKNKNTF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R==Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


9351 


23252 


A 


9420 


133 


3 


ELMFS WAWWIMPVI PALWEAEAGGSRGQ 
EFKTSLPNMVKPRLN 


9352 


23253 


A 


9421 


3 


133 


LRGLYSGAGI IRFEAKKWNALALWAWDI 
AGDNCAI CRNH IMDL 


9353 


23254 


A 


9422 


409 


291 


ILILGGGSCSELRSCHCTPAWATRAKHR 
LKNKSPINVY 


9354 


23255 


A 


9423 


1 


210 


RF FHLGL S KHWD YRCE S PHAGTRNE VL I 
YATIWMNSENIVPNEIIQTQRTNIVSFY 
LYQVGRIGKFIDRR 


9355 


23256 


A 


9424 


479 


306 


GVPTNQPWSNRVKTPSPTKNIKISWPWW 
CMPCNPSCSEGQGRRIALNPGRWRLQ 


9356 


23257 


A 


9425 


1 


209 


GFFLFQGGGHHAGQAGLELLTSGDLPAS 
AS Q S AG I TGAG PNQ G I F VALE S PME S AA 
LLHLHWLSSPCS 


9357 


23258 


A 


9426 


628 


433 


RQGVAHLPGMECSGTITTHCSLELLGSS 
DPPASASQSSRLAWSHCAQPSIilaFSVL 
CLKPIFCCA 


9358 


23259 


A 


9427 


417 


125 


FLRQSLTSSPRLEYNGAIPAHCNLCLPG 
SSDS S ILASQS IRI TGVNHCTWPDAAVL 
RQNFFFSEKPQFYSEGFQQTHI XKSNLL 
YLKSTKCTCYPYL 


9359 


23260 


A 


9428 


366 


256 


SF F YRDRS LATLPRLGSNAWS QAILP PG 
PPKVLRLLA 


9360 


23261 


A 


9429 


3 


140 


GFCHVGQ AAL KLL I S S DL P T S P S QS AG I 
TGVSHCARLTRLFS SHE 


9361 


23262 


A 


9430 


368 


3 


VRASQIiLGIiPTQASAMAGTPPPASLPPC 
SSISDCCASSEQGSMGVGPSEPGAGYNL 
WCSSLRTDEKCS IRVGVPQFSRCHLSQ 
LPLARKGNS PT P CAS WVRRCLTLLQLML 
RGLYPLSCT 


9362 


23263 


A 


9431 


3 


131 


LTMLPRL VSNS WAQGS SHL S PP KCWD YR 
CEPLHSGLKDIFLE 


9363 


23264 


A 


9432 


384 


274 


LSFAMLPRLVLNSWAQESAHLGLPKCWD 
YRHEPLGLA 


9364 


23265 


A 


9433 


431 


273 


PPPLGYFFIVKMEFPPWQDGFKLLNSS 
DCSTSASPSGRISGVNHSTRPRIF 


9365 


23266 


A 


9434 


211 


473 


CVCTFFSFLSMHI FFLFLVGEAESHCVT 
EAGVQWHDLGSWQPPPPRLKRFSCLSLP 
SSWDNRRVPPYPANFCIFSRTRVWSCWP 
GWS 


9366 


23267 


A 


9435 


391 


188 


L PGS S DS PAS AS RVAGTTGGCRHTVLLG 
FE FKDNI RWI KLVE KAHTQ YGNI Y I CVI 
KYMPLFYPYHH 


9367 


23268 


A 


9436 


331 


115 


SRDRGFAMLPRLVLNSWTQAICSLLDSQ 
SAHLS LPKCWNYRRE PLCPASGE ILSFI 
KP S S S KS FHN"S TKKRQ 


9368 


23269 


A 


9437 


3 


109 


IYGSYSLPACFFPSFLPSFLPSFLPSFP 
PFFLSFF 


9369 


23270 


A 


9438 


2 


167 


E CNGA I S GHCNL CL S GS S DS PAPAWATR 
VKLiRL I KKKKKKKNQ I LFFYP PRKTLF 


9370 


23271 


A 


9439 


648 


310 


VCPCILRHPAFLFAKSGNPLESSLLDHS 
SPTFKLTDRENLRHVKCTSHSQSLPTSG 
FQPGLPDS TL CAWYPTRS PKLS PAGWVG 
GYRAP VGRRGPGS RNS WRLHWT P I TS VG 
D 


9371 


23272 


A 


9440 


1 


170 


YEIWFLDVGQAGLELLTSGDLSASGSQS 
AGTTGMSHHARPLKLFTI S VS SNKKVKC 


9372 


23273 


A 


9441 


133 


244 


LFGYQTRGLAMLLKLDSMSWPQAVLPPQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown ? *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














PPKVLGLiLA 


9373 


23274 


A 


9442 


365 


176 


RGS MDMVPSE S CTG YNLLMC PLLRPLE K 
RSIRVGVTRFSRCHLSQLPLARKGNSLT 
PCASQVR 


9374 


23275 


A 


9443 


368 


231 


VEAGFCHIAQAGLTFLSSTDPSALASRS 
ARITGVSHHARPI FLKE 


9375 


23276 


A 


9444 


88 


194 


KMLEAKKKIEQQGGFTFENKGVLSAFNF 
GTVPSNN 


9376 


23277 


A 


9445 


3 


349 


IRSGSYACYFCYPRECYALFHCYSMIVS 
YGP YYFMNIDQRHE SRLSTI YI FKFE IG 
S KHVAQ AGLE LLC S S S P PAF TS Q SAG I T 
SVSHHTRPQKIFNVIYSCDLHFHPGIEG 
PQR 


9377 


23278 


A 


9446 


3 


141 


HAECSSMMIAHCELELLGSTDPPASASQ 
SAETTGISYHAQPYSTF 


9378 


23279 


A 


9447 


2 


282 


ICPTRSRFPPRVLKNGRKKDRLECSGAI 
SANCSLCLPGSSDSPASASRVAGSTVAE 
EDS TRRAVRHQGS SRSAPSGRGS WAGKQ 
ERADRAQQS 


9379 


23280 


A 


9448 


307 


430 


GLAWWLTPGIPILWEAEVGGSLEVEFET 
SLASMEKPSLYYK 


9380 


23281 


A 


9449 


2 


253 


IQCVCVCVCVCVCVCLSVCLHFIAILGC 
SPWGSRLPCSVSSPSGRGPGGEDLSPLP 
PTMS VS PEADFP PLAAAAPGN I MTAI T 


9381 


23282 


A 


9450 


1 


166 


NTGGGGEEIRVFSSCFAEDRLQKGKDRP 
GAVAHACNRS TLGGRGGW I TRS QIFFC 


9382 


23283 


A 


9451 


1 


123 


NTRGLVMLPRLVSNSWAQVILQPLPPKV 
LGLQASVTVHINN 


9383 


23284 


A 


9452 


396 


291 


S LC I KRVCACAC VC VC VC VS GAKLRKLV 
LDFMRS 


9384 


23285 


A 


9453 


1 


147 


FAIRARFSNTYTKIGTIHTYIYIYTYIY 
TYIYTHTHIYILSQVLMEYEF 


9385 


23286 


A 


9454 


394 


235 


TRSCYIAQSWQWLFTGTLI IHYNLQLL 
SSRDTPVSASQYLGLQAHTTTPDSC 


9386 


23287 


A 


9455 


1 


85 


TNPVATLLEWQDALCFI PVGSAAPQGS P 


9387 


23288 


A 


9456 


339 


165 


KTHWWGKKS FLKNPFFFFFFFETKSRSV 
TRLECSGVILAPKQKTKKQNDKTTINKT 
X 


9388 


23289 


A 


9457 


3 


209 


LLILFSIKMRNRPDSPALLSTQLPPIFI 
YIHTHIYLYIYTHIYIFIYIYLYIYIjFA 
YIHIYVSLFPLK 


9389 


23290 


A 


9458 


374 


156 


QAGLELLTSGDPPISASQSARITGLSQR 
AQLSELTLTVFTTRHCTCLCAHFLYYTV 
FLEGRNDDE F I S I P S T 


9390 


23291 


A 


9459 


287 


3 


EAGLLWRKATHKYAHTHTHTHTHTHTHI 

RGLQNM INCKYITYQE IMTLAQAKNKGV 

I PDSFPFTS S PPLT YPPALS FHFQKHI P 
NKTTTYHGHC 


9391 


23292 


A 


9460 


3 


72 


KTELKMKFTN I LESS FLMNNKS 


9392 


23293 


A 


9461 


327 


92 


ISLFLVWGALFFPKKIIYPPKKKSPLFF 
PPPPPQENFNPPSFLAPPVFPPQEMPKF 
AGGRKI FCPPPPVFFFYFFFFF 


9393 


23294 


A 


9462 


3 


173 


ACRGDSGGPLVTRDLKDTWYLIGIVSWG 
DNCGQKDKPGVYTQVTYYRNWIASKTGI 


9394 


23295 


A 


9463 


258 


39 


LKVILPSDPS S I FVCLFLENESRSVTQA 
GVQWHDLGFKRFSCLSLLSSWDYRRVPP 
P PAKF C I FRKHGVS P C 
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SEQ H> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine y 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


9395 


23296 


A 


9464 


104 


336 


NHVSQCEE I FLFDE Y YI FS RAVASMFW V 
DAELGSDIYLDGKLKNSFLLSFLGFFFF 
FFLGKKVFFFSQNGGLGRPGQ 


9396 


23297 


A 


9465 


410 


0 


LNLGGGGC S E PRL CP CTPAWATE 


9397 


23298 


A 


9466 


254 


61 


KKKKDAFFTIPLYPSSQNLFAFTQMDPD 
TN I S QQLTWTVLP QGFRHS P PTFLDKP F 
KQTS INF A 


9398 


23299 


A 


9467 


403 


255 


PAPQLGGP PMMPMMAP PS S WMMPMGCAP 
AMRLPIGGDIIWLCVPTQISP 


9399 


23300 


A 


9468 


1 


1428 


PARFKDLNIiAGTAEVGLAGYFMDHTVAF 
RDLPWMVCSSTCYRAETNTGQEPRGLY 
RVHHFTKVEMFGVTGPGLEQS SQLLEEF 
LSLQMEILTELGLHFRVLDMPTQELGLP 
AYRKFDIEAWMPGRGRFGEVTSASNCTD 
FQSRRLHIMFQTEAGELQFAHTVNATAC 
AVPRLLIALLESNQQKDGSVLVPPALQS 
YLGTDRITAPTHVPLQYIGPNQPRKPGL 
PGQPAGPRVGYKALQPWPAQRLGPVGRP 
PCRPEGQPLSCCCCGCSCWGRLTRCLAE 
WTEVLTVGQVQRGPCTALLHKELCGTE 
PHGCAS TEE KGLLLGD FKKQE AGKMRS S 
QEVRDEEEEEVAERTHKSEVQEQAIRMQ 
GHRQLHQEEDEEEEKEERKRGPMETFED 
LWQ RHL ENGGDL Q KRVAE KAS D KETAQ F 
QAEE KGVRVLGGDRSLWQGAERGGGERR 
EDL PHHHHHHHQ P E AE P RQE KE E AS E RE 


9400 


23301 


A 


9469 


265 


170 


GTE SHS VTQAGGQWLNLS S LQ PLPPGFK 
QFS 


9401 


23302 


A 


9470 


289 


110 


RDLELASFLSFFPSFLPSFFPSFLPSFL 
PSFLSFSFLISFFPLPSFFSFISFSLSL 
FFE 


9402 


23303 


A 


9471 


67 


210 


TVTTGQALTHAYNPSTLGGRGGQITRGQ 
D FE T TLVP GD LNF S RVLAS S 


9403 


23304 


A 


9472 


2 


186 


TS L WRQ AKKNVE KKTDKYTE VL KTHGL L 
VCTQKS CS FLKNKNKAWLGKQANAPARK 
VEIRC 


9404 


23305 


A 


9473 


417 


265 


FLSSFLFFFFFFFFFFFFFFFFFTV 


9405 


23306 


A 


9474 


430 


120 


ASQKKKKKNKKKKKKKKKKKKKGGAL 


9406 


23307 


A 


9475 


1 


150 


PTRPWQAGFKVLSSGNPPALVSQCFRI 
TGVSHRARPHLLNTjRAPNLPVIi 


9407 


23308 


A 


9476 


404 


109 


PPPLFFNFGPFFLVEARLHHVGQAGLKL, 
PTTSDQPDSASQSAGFTGVSTRAQPTLI 
TLY 


9408 


23309 


A 


9477 


391 


235 


P PFFFFCRDRGLTML PRL I SNS WAQVI L 
LPWSP KMLGLQARAT VSGLNKRYS 


9409 


23310 


A 


9478 


438 


277 


KRAP FFFFCRDRGIiTMLPRL I SNS WAQ V 
I LLP WS P KMLGLQARATVSGLNNRYS 


9410 


23311 


A 


9479 


399 


279 


PPPPVFKNPPPKKNKGAPKKKNISPPPG 
RKKYFFFKGPP 


9411 


23312 


A 


9480 


247 


26 


IiKAGDDSRTVLGKVHQSPCFSLLLFFI Q 
VSCRDEGLTVLPRLVLNSRPQAILiIjPQL 
PKLLDHRCEPPHLANHG 


9412 


23313 


A 


9481 


2 


176 


HAPPHPANFLFFEEKESCYVTQVSIKLL 
DS SNP PALTSQSAG I TGENQHGHPMNCL 
SG 


9413 


23314 


A 


9482 


2 


178 


HAPPHPANFLFFVEKESCYVTQVS IKLL 
DSSNPPALTSQSAGITGVSHHGQPMNCL 
SG 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIaIanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=0Lysine, L=Eeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9414 


23315 


A 


9483 


2 


151 


GRVGEGPELVSLLVLDQHPVGTSDHRDA 
TQLGFQPLMSDLGSTGSTEGA 


9415 


23316 


A 


9484 


420 


325 


FFVFLLETGFHHVSQDGLDLVICPPRPP 
NTFV 


9416 


23317 


A 


9485 


400 


104 


LFFFLFFFFFFFFFFFFWNPPKTFFKKV 
FGLNFPTLLKLKFNPPPPGPPKKKKKEL 


9417 


23318 


A 


9486 


117 


280 


S S TNQANN CF F F FKKRGLA I L P RGALNP 
WAKGILLPWPPKGLKLRGGAIGPGQK 


9418 


23319 


A 


9487 


76 


353 


VLWNTLE FCL I YLDLLFLIiFFNKEAL I R 
KKKKKKKKKKKRGGNKKE KKKKTPWQE K 
AGLFLGDT I I YSRGAVLKNSGGETPRHT 
PSGGLWPR 


9419 


23320 


A 


9488 


3 


122 


HASEDNVFLHCSHSDLGYVLLoAGTASLQ 
DFSFFQKSPPP 


9420 


23321 


A 


9489 


1 


155 


PTRPTVCSGAMIiAHCNLCLPGS SESLAS 
ASQVAGTTACTSTPDPFFFFFSL 


9421 


23322 


A 


9490 


3 


212 


ETGFHHVGLSGFKIiLTSGDLPTSASQSA 
GITGMCHCVWPQFAYFCLLLVYLDHLKF 
TIIMLVLCLLFYD 


9422 


23323 


A 


9491 


2 


163 


E TLFRHVGQVS FE PLTSGDLPAS ASQS A 
GITGVSAMPS WMI KFFI AFYS C INC 


9423 


23324 


A 


9492 


474 


296 


S QPAGTAD CLL PL P PGS CF PT I P CPAAL 
VPLPSFPSARLWPPQSLHQLLPLPGTCS 
SCT 


9424 


23325 


A 


9493 


7 


151 


L CAVE FVYMKP P AHEEAKAP S RGFWRY 
APWTASSSEKGRDDGMGPRA 


9425 


23326 


A 


9494 


52 


363 


TTAPSLSVFLFTFTGELFVYWWVGVTLE 
TSFISFFLFFFSFLEKDFSFSPAGRPWP 
NFGL I QPLASG VGG I S RPNP PS RWELRA 
LPPPPGGLGFFFWKKGGPPG 


9426 


23327 


A 


9495 


383 


118 


WKPRPWVPPGAGVQTPPGPQGETPLFLK 
KKKKRGRGGPLFGPPSFGGGGGEIPLTP 
APKGQEIYFPPPPPPPGGKKKSLFQKKK 
KRIR 


9427 


23328 


A 


9496 


554 


419 


FFFETKPCSVSQAGVLWRDFGSLQPPPS 
QVQRDS P CPQAS QVAWD 


9428 


23329 


A 


9497 


428 


281 


GSYVQLTFKYFS I YIYVHTHLHVYLYTH 
IYVYVHICIYKEGYITGSVLA 


9429 


23330 


A 


9498 


144 


16 


KVKKKVS QCPG I VAHNCNP S TLGGRGGR 
IAWGQEFETLSLQKN 


9430 


23331 


A 


9499 


2 


155 


NLALLGSSDPPVSASPVSGIRGVHHHAQ 
LIFFFFFGKRAWFNPPGGRKGEE 


9431 


23332 


A 


9500 


2 


184 


F I YGRD S L TML PRL VLNS W P QA I L P PR P 
PJCVLGFRCEPPHLASL.ILFYRDILFTLG 
LKSV 


9432 


23333 


A 


9501 


465 


153 


PHKVFLNKGPFETPQGELSRGEWKTRPP 
PPRERSRPFHRKGGFGSPPKLRQGPPFV 
FFSWPPWCPPPKKGARPQQKGFFYFPPP 
FFNPPFFFFFCLFF I FE IV 


9433 


23334 


A 


9502 


429 


11 


CRTLLGADCVCVC VC VCVC VCVKAFPP I 
SYSCHLVPSDLPLLGVQFKCWVHFGLSF 
LTCPFPSPPWPGGAQPGSCYWGLGCLCL 
FACKLGARLTEQTLGI PASGPGTWATPL 
GS CAS VLRRLPGGS ERDTDLKVS VLSG 


9434 


23335 


A 


9503 


436 


202 


LPGGYI FQGEGVEKTKFSPLLRENNFFS 
NFFPPKKNNGGGPRIPPPPPPPLFKKGV 
F F I P KF PAP PR P KKKF PKKKKK 


9435 


23336 


A 


9504 


3 


112 


S FDLLS SSDPPALASQSAGI TGVSHSHG 
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SEQ ir> 

NO: of 

nucleotide 

sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQH> 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QKVYICNM 


9436 


23337 


A 


9505 


426 


208 


VGHNLLVPRFLSPSEKRS IQVGVTRSSR 
CRLSPLSLTRKGNSLTPFASRVRQCLTL 
LRLARAPTDLRPLSGTP 


9437 


23338 


A 


9506 


441 


189 


CPANFFVFLREMGFHHVGQAGLLTSSDP 
PASASQSAGITGLSHCARPTVGSDVLSL 
PSGQCTSALTGFMLGSTTVHSNAAARA 


9438 


23339 


A 


9507 


117 


278 


NKALLCHPGWmiKKVNGRLGMETHTCN 
PTTLGGQGQEFENSLGNIGRPHLYK 


9439 


23340 


A 


9508 


3 


126 


FFVEMGFRCVGRAGLRLLTSGGLPVLAS 
QGAGITGVSHCI 


9440 


23341 


A 


9509 


421 


134 


FLVERRSHHAAQASLEILSSSDSPTSAS 
QCAGITGMSHCARPKYTSFFRLHRHMPM 
ATPTKLPPTLASAS PQVKVCLYPLLPST 
S KGE I FTLQLY 


9441 


23342 


A 


9510 


2 


200 


PQPPTFYPSFLPSTPEPPAFYPRLPPST 
PASRLLPQPPAFYPSACHLLPQPPTFYP 
SLGNRARLHL 


9442 


23343 


A 


9511 


414 


204 


ERDRKKERKKEGRKERERERKKEKKRKK 
KKERRKEGRKEGRKEASKGQRWVIMVD 
T 


9443 


23344 


A 


9512 


1 


211 


FHHVG WAGLGLLT S GDL PAS AS QS AR I T 
GMSHHARHRHGFDNLSLCALAPSPTPLW 
RRHRRRGH FT VP E K 


9444 


23345 


A 


9513 


390 


265 


VFLVEMGFHHVGQAGLELLTSGDLPPLE 
SQSAGITGVSQQFS 


9445 


23346 


A 


9514 


438 


180 


I G FRHVGHAGLD FL P LGD L P S LAS QNAR 
I TGVSHRAWPAMAFVNCHGAGGS VAMRT 
TRGHS CRHLG I GGFWPAS LLQP VLSAGS 
L 


9446 


23347 


A 


9515 


126 


11 


RNLDFFLSLGLAMLSRLVNSWPQVIFQA 
WPPKALRFQV 


9447 


23348 


A 


9516 


2 


249 


FLHVGQAGFVLPTSDDFRPTHTPQSSGI 
TGVSHRTRPLRCYFLRLNNI PLYVYAYY 
ILFIHSVISGYVGGCTAHYCHPRKW 


9448 


23349 


A 


9517 


403 


57 


PEGALVPQKKKKKKKKKKKKKKKKKKKK 
NIKI 


9449 


23350 


A 


9518 


406 


140 


PHYNHTFVYNGVRLEDLQHMCLELTVWD 
REPLASNDFLVGVRLGVGTGE IPSPILT 
CLKAS CLFYSYYHSRPPTQSSS VMTWF 
FPFS 


9450 


23351 


A 


9519 


3 


75 


G I EWMVNC WRNVHRPPRQRDI T 


9451 


23352 


A 


9520 


183 


6 


LLTHL I LTSAL I GE I MVAP I LQKNSGS E 
RE CRL PRL VLNS W P QGI LL P W P P KVLGS 
QV 


9452 


23353 


A 


9521 


89 


339 


LQCLQKSGKLEPGMTYTKLIDADVNVGN 
I TS VQ F I WKKHLFEDS QNKLGAE MVINT 
S GKYG YKS T F CS QD I MGPN I L QNL KP C 


9453 


23354 


A 


9522 


3 


329 


PSLMFPWLKLSFPGRPRGRQTYSRYQTL 
ELKKE FLFNPYWTRKRRME VFHALGLTE 
RQGKIWFQNRRMKGKKENTRDNLPEAQN 
ESKVEEKGIEEKEKEEEEKEENRD 


9454 


23355 


A 


9523 


1 


156 


AKTSMNVNE I FMAI AKKLP KNE P QNPGA 
NSARGRGVDLTEPTQPTRNQCCSN 


9455 


23356 


A 


9524 


1 


272 


AKNSARALLGILGYRSHLISSLLCLEGI 
I LSL VI I ATIilTLNTHS LLAN I GP I AIL 
VFAACEAAVGLALLVS ISNTYGLDYVHN 
LNLLQC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D-Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G^GIycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine ? P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9456 


23357 


A 


9525 


1 


,62 


GNDAWVCTNDMFRLALHNFG 


9457 


23358 


A 


9526 


1 


111 


GDI IHNENMRQGRDKYKTLRQ IRQGNTK 
QRIDEFEAL 


9458 


23359 


A 


9527 


427 


340 


I P I GN IMS CD S HSHHE L VMS AQLNA I FH 
D 


9459 


23360 


A 


9528 


439 


27 


PRRTDKWLTVEM I QDGHQ VSLLSGELTV 
DQRAS 1 1 QRFRDGKEKVLI PTNVCARGI 
DVKQVTIWTFDLPVKQGEEPDYETYLR 
RIGRTGRFGKKGLAFNMIEVDELPSPMK 
I QALFDGRNKQLNAEVP CI QAAMED 


9460 


23361 


A 


9529 


1 


236 


RE VL I TTNVCARG I DVKQVT I WNFDLP 
VKQGEEPDYETYLHSIGRTGRFGKKGLA 
FNM I E VDEL P S LM KI QDHFRKS 


9461 


23362 


.A 


9530 


437 


240 


QQVS SARPEP E AGE VS P P VGAGVNSNS W 
AFKYGPGNPKQSGPEPKKQTQVSFLLRR 
KGEASQPRQ 


9462 


23363 


A 


9531 


413 


260 


LFDQWKVASPSGLRTSTVRYGDPEKLK 
DEESLRQQAEKEGSALSVRISNV 


9463 


23364 


A 


9532 


431 


260 


ASSGQMFSVKQQAPPVRDLPPFGISLFG 
GQIYDNISPRPPQGIVAPPGGRGNIPNL 
G 


9464 


23365 


A 


9533 


2 


134 


IHGVCVHMCVCACVCVYACVCVGGYDGR 
I FLALS STVP SMVNIM 


9465 


23366 


A 


9534 


419 


53 


LLLLTNGDRQTQREKIEACACQSYFDAV 
WGGEQREEKPAPS IFYYCCNLLGVQPG 
DCVMVGDTLETD I QGGLNAGLKATVW IN 
KNGIVPLKSSPVPHYMVSSVLEIjPALLQ 
SIDCKVRMST 


9466 


23367 


A 


9535 


439 


109 


NCIVI IDSTPYRQWYESHYALPLGRKKG 
AKVTPEE EE I L YKNRSKKI QKKYDERKK 
NAKI S SLVEEQFQQGKVLAC I ASRPGQC 
APADAYVIEGKELEFSLRKIKARKGK 


9467 


23368 


A 


9536 


1 


185 


NTTSSHHFTIPCIjVLYSCTSTHTHIHTH 
THTHTHTHVRVFSYIPSQDVKEGCIPAP 
SVFSW 


9468 


23369 


A 


9537 


168 


280 


NPNSIRLTSCCCCCCCCCWKSPYNVYNR 
LbRTGFTFP 


9469 


23370 


A 


9538 


142 


324 


KIVKMGFPHAGQAGLELLTSGDTPGSTS 
QSAGITGVSHRSRARSIFLYPYLPSVYL 
PWRRV 


9470 


23371 


A 


9539 


3 


139 


YIGGMGQAWWLMPGKPRQNDYLSPGVQD 
QPGQHSETPSLQKGFKN 


9471 


23372 


A 


9540 


383 


318 


MSTQNNIQSYPIGEDEESESD 


9472 


23373 


A 


9541 


372 


207 


YGLNMCRQCFRQYAKDIGFIKVGVCRRT 
P CCLGREGCGG YF FRERQTDRS SLEW 


9473 


23374 


A 


9542 


2 


360 


ARGDDDHRSSDRKDGVLEEQIERLQEKV 
ESAHSEQKNLFLVI FQRVIMILTEHLVR 
CETDGTSVTjTPWYKNCIERLQQIFLQHH 
Q I IQQ YMVTLENLL FTAELDPH 1 LAVFQ 
QFCALQA 


9474 


23375 


A 


9543 


379 


75 


TNLQQFAMSHGSAQVKGHGKKVADALTN 
AVAHVDDMPNTL S AL S DLHAHKLRVDP V 
NFKLISHCLLVTLAAHLPAEFTPAVHAS 
LDEFLASVSTVLTSKYR 


9475 


23376 


A 


9544 


380 


56 


RDLSMVRMKSMFAIGFCFTALMGMFNS I 
FDGRWAKLPFTPLSYIQGLSHRNIjLGD 
DTTDCSF I FLYILCTMS IRQNIQKILGL 
APSRAATKQAGGFLiGPPPPSGKFS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KMLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


9476 


23377 


A 


9545 


304 


92 


S KS RVTNNTLRS TRKDTMI TLNTHS LIiA 
NT VP I AI I VFTACEAAVGIxALLVS I SNP 
YGLDYVHNLNLLQC 


9477 


23378 


A 


9546 


394 


108 


KE ID F SLVDVTADDAGNYS CMYYQTKS P 
F WAS EPS DQLE I L VTVP PGTTS SNYS LG 
NFVRLGLAAVIWIMGAFLVEAWYSRNV 
SPGESEAFKPE 


9478 


23379 


A 


9547 


1 


335 


GNAVGLNLHR INFDKYHPGYFEKDGMKH 
YHLHRNQSFCPTVNLDKLRTLFSEQTRL 
NAAKNKTGTAPI IDAARSVYYKVLGKGK 
LPKQPVIVKAKFFGSLIFCFRIWILID 


9479 


23380 


A 


9548 


326 


188 


NFLRMGTVAHASNPS ILGGLGGRI AGAQ 
E FETS LGNITRPCL YFKK 


9480 


23381 


A 


9549 


381 


152 


VTL ISLS LLLQWI GVLLWI ARLNLNE 
VEKQWRLNQLNNAAT I LVF FT WINVF I 
TAFGAHKTGFLAARASRNPL 


9481 


23382 


A 


9550 


170 


65 


I HRSKNKWKFHLKDGIMNLNGRDYIFSK 
AIGDAEW 


9482 


23383 


A 


9551 


544 


361 


RRLRQENRLNPEGGGCSKPRSCHCTPAW 
ATE QD S S S KE KKE R KNL KGNT VKS VP L P 
IjTSHA 


9483 


23384 


A 


9552 


369 


181 


GVLGAT S GDMQ I HT FGSM I IGS ITGMVS 
ELGYKFLTVSTADVMIQTLNLHLFSHMS 
HQQKKSY 


9484 


23385 

- 


A 


9553 


1 


516 


S LVHWE FGQAI ARRLKP FGVQRFLYTG 
RQPRPEEAAEFQAEFVSTPEIiAAQSDFI 
VVACSLTPATEGLCNKDFFQKMKETAVF 
INI SRGDWNQDDLYQALASGKI AAAGL 
DVTSPEPLPTNHPLLTLKNCVILPHIGS 
ATHRTRNTMSLIiAANNLLAGLRGEPMPS 
ELKL 


9485 


23386 


A 


9554 


386 


56 


Ij FP AC VLRWQ FAS DGGD I G FGVFLKT KM 
GEQQS AREMTE VLP SQRYNAHMVPEDGS 
LTCLQAGVYVLRFDNTYSRMHAKKLSYT 
VEVLLPDKASEETLQSLKAMRPSPTQ 


9486 


23387 


A 


9555 


375 


150 


I WGKTS FHVS I PLSLFFPGDVLYDLLQ 
HILKQSKPRILFS P FFHPGNS IHTQPE V 
I LHQTHEEGTGRGFS FLA 


9487 


23388 


A 


9556 


3 


354 


CLP TQASAMVDTP P PALLP PCS L I SDCC 
ASNEQGSVGIGPSEPGAGYNFLVCRLLR 
PLEKHSIRVGVTQFSRCCLSPPSLTRKG 
NSLTPCTSRVRRCLALLQLTLSEIiPHCP 
VPSV 


9488 


23389 


A 


9557 


463 


40 


PIKVEDPEPVKKPPKEQRSIKEMPFITC 
D E FNG VP S YM KS RLT YNQ IND V I KE INK 
AVI S K Y KI LHQ P KKS MNS VTRNL YHR F I 
DEETKDTKGRYFIVEADIKEFTTLKADK 
KFHVLLNILRHCRRLSEVRGGGLTRYVI 
T 


9489 


23390 


A 


9558 


338 


104 


NFKKKKKRRTKKKE I RTEEKGRGANKKT 
FRG P KKKE TRG S NRS P EGKKNHGAKKE G 
ERRPRKKKKKKKKLARYDFTCL 


9490 


23391 


A 


9559 


1 


152 


NTCVCVCVYTGFCHVAQVAHELSSSNLP 
S S AS Q SAG I TG I SHHAWPGFGF 


9491 


23392 


A 


9560 


517 


238 


SGPGESVPHPWSAVRTRPAKKPGSQCLL 
Q S AF P LWC S SRRT T WAEERKLNTE T FG V 
SGRFLRGRS SRGGFRGGRGNGTPRRNPT 
SHRAGTGRV 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=Phenylalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, I=Isoleucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, JD=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


9492 


23393 


A 


9561 


709 


280 


NTS PALTAP CGR P RG I R P MQ C WARVP AP 
SSACPAWQLPSLCLRSTVPAPHPSPAAH 
QPCELSQHHRGGLEPALPGTRQRLCSRT 
WHRCVRFLPGAQEAGELGPAACATGRLP 
VSHSTDMEQAPGLTSLHQEPHSTARGGT 
ATH 


9493 


23394 


A 


9562 


3 


130 


YRGFCHVGQAGLELLTSGDRITSASQSA 
EITGVSHRAQPQMS 


9494 


23395 


A 


9563 


409 


160 


NPGVKRS P CLSLS S SGGYWSMP PCLAYY 
KKKKRRFLKDGGLTMIiPRLASMSCAQQF 
SHFGLPQSWDYRHEPLHIAQFFLFSLL 


9495 


23396 


A 


9564 


1 


241 


GICLGSNIGSGWMGTSSLCNPYSNTLNQ 
NLLSTTKPFPVPSVGTNCGIEPWDSGVT 
S GLGKML FLYYYYL LLtF SQMTPQS 


9496 


23397 


A 


9565 


2 


215 


IHWMNGWMDGWMDGWMGSCVRAVTGPGT 
S SPHTSTCGHAVCVGVCVCVYTCAHVFL 
ALFGVEKNNYFLWKA 


9497 


23398 


A 


9566 


244 


117 


DHQHLNWS VF ISFS FKVI KARKNKKG I L 
NPDSSMETSPDFFF 


9498 


23399 


A 


9567 


414 


77 


FGGVPPGTLVPPTWLPPVRVPPPPAFCL 
GELSPPPFFWRKGLAPPLLNFFFSPGQR 
GFPSPGGKGPRPLPRPAGFPGFVAKPPF 
RPPGFSSTPSDCGGLPFFFFFFFKKKVF 


9499 


23400 


A 


9568 


418 


198 


PPVSLTEVIVSKSHKAELVFWNMPGTLE 
HPEGDNNYMEFIEVLTEALERVLLVRGG 
GSEVITIYSYPTLNARA 


9500 


23401 


A 


9569 


549 


415 


RQENRLNPGGRGCSEPRSCCCIAAWATE 
GDCLKKINNNKPKNEQK 


9501 


23402 


A 


9570 


546 


1 


KLECRGAITAHCSLNLPGPGDPPTSATQ 
AAGTTGTCH 


9502 


23403 


A 


9571 


415 


215 


SRLRKKIYIYIYMCVCVYVYICVYICIY 
VCVYICI YVCI CVYI CIYVYI CVYMYI Y 
MHEFITFFSI 


9503 


23404 


A 


9572 


374 


138 


IiAASAAIQELFKRISEQFTPMFRRKAFL 
HW YTGEGMDEME FAE AESNMNGLVS E YQ 
Q YQDAT AE E EGE FEE E AEE EVA 


9504 


23405 


A 


9573 


370 


233 


GSPPVRSVHQSGVSLSGTQVDEGVRSGS 
KRMVAP PGGRYN I T S L S 


9505 


23406 


A 


9574 


3 


399 


YSSPGPIALALRDPCAHGGPSPNSSLSP 
AQGHPARAQQGPQPQQGPPSHLGSDLRP 
HVAHTRGALHGPAPAPASPAPGGSGSGV 
AASMCSAPRPGRAGGTLGSVLTAHHPAS 
AGCAHRDAYVC KAP GRAVF 


9506 


23407 


A 


9575 


314 


48 


SILPHSDPERCFSIQPEEGTIHTAAPLD 
RE ARAWHNLTVLATELDS SAQAS RVQVA 
IQTLDENDNAPQLAEPYDTFVCDSAAPG 
QVSN 


9507 


23408 


A 


9576 


505 


264 


I F WEMEFHHFGQAGLRLLTSGDMPASA 
SQSAGVTDMSHCTRPDTPFNFKELPGAV 
VTFLL CGMQ VGRDELEDDWMGAWS 


9508 


23409 


A 


9577 


508 


314 


PATWKGCLSNFSSYIiTKPADFIEHQVLS 
WEQVPDGFFI FNERFKS FTVWLNWVAE 
FVCKYKLL 


9509 


23410 


A 


9578 


512 


1 


RMREAI FS PGRLPRGAAPLRGPAGTLVP 
WMPTPGECWL.SLSVSAAGEKPYKCSVCE 
SAFNRKDKLKRHMLIHEPFKKYKCPFST 
HTGCSKEFNRPDKLKAHILSHSGMKLHK 
CALCS KS FSRRAHIiAEHQRAHTGNYKFR 



1119 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K—Lysine, L=Leucine, M=Meth ion ine, 
N=Asparagine, P=Proline, 
Q-Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibie 
nucleotide deletion, V=possible 
nucleotide insertion 














CAGCAKGFSRHKYIiKDHRCRLGPQKDKD 
PV 


9510 


23411 


A 


9579 


3 


487 


WDL I YVGRKRMQVEHPEKAVPRVRNLVE 
ADYSYWTLAYVISLQGARKLLAAEPLSK 
MLPVDE FLPVMFDKHP VSE YKAHFSLRN 
LHAFS VE P LL I YP TH YTGDD G YVS DT E T 
SVWNNEHVKTDWDRAKSQKMREQQALS 
REAKNSDVLQSPLDSAARDEL 


9511 


23412 


A 


9580 


1 


840 


S KYLYDEGEEEEVFNDEWGAAGGVAVPT 
LNALERGFLSAMDWHL YTDPRE I FEVLS 
WLESCVAEQQGRWRGWYTYTDLCVLLEQ 
P TWQLALGS LCQRL VKDSCLLAVAYVS S 
VALAVASVAVIHQSLGLSCTPTPGPPDL 
GLTSRCLLEPCIPSVPQCLPSIiANVSSC 
LEGSMGLRSLWGSLLASLTPPPLPPPDP 
PAP P TLLHNCHL CQ KLQRD S P T CHACLH 
PNRTVPTALSSPWYHTYGLAPPWPWSPV 
PLSLPQPQQCSLFS VMELARLKS FVFPG 


9512 


23413 


A 


9581 


2 


158 


IHAGFISVSVLDNIIYICMCVCVCVCVC 
AHMCACISEHITHAPRLPEVFYTP 


9513 


23414 


A 


9582 


3 


144 


YTAGVISVSVLDNI I YICMCVCVCVCVC 
AHMSACFSEHITHAPMLP 


9514 


23415 


A 


9583 


503 


3 


GKCVCLSAVWGSLLSAFRGSLSVCCLGQ 
FVCLSAVRGSFSVCCPGQRVCLSAVRGS 
VSVCLSAVQGNVSVCLSAVRGSLFVCLL 
SGAVCLSFCLLSGAVCHLCRAPTPCLSR 
S LAHRRCS VNVCEMTQL I S VS YFFCWGW 
WRHRNNIHLQKCRLRILMAYTGQGPP 


9515 


23416 


A 


9584 


2 


1358 


AGLDS LHKFQVKI EKEKVYVRAS KQALQ 
LQRRTKVMAKC ISP SAGYSSS TNVLI VG 
AETLRQEGFSDRIVLCTLDRHLSYDRPK 
LS KS LDTQPEQLALRPKE FFRAYG I EVL 
TEAQWTVDVRTKKWFKDGFKLEYSKIi 
LLAPGS S P KTLS CKGKE VENVFT I RTPE 
DANRWRLARGRNVWVGAGFLGMEVAA 
YLTEKAHSVSWELEETPFRRFLGERVG 
RALMKMFENNRVKFYMQTEVSELRGQEG 
KItKEWLKS S KWRADVCWG IGAVPAT 
GFLRQSGIGLDSRGFIPVNKMMQTWPG 
VFAAGDAVT F PLAWRNNRKVN I PHWQMA 
HAQGR VAAQNMLAQE AE M S T VP YLWTAM 
FGKSLRYAGYGEGFDDVI IQGDLEELKF 
VAFYTKGDEVIAVASMNYDPIVSKVAEV 
IiASGRAIRKREVELFVLHSKTGDMSWLT 
GKGS 


9516 


23417 


A 


9585 


360 


239 


Q CRKGLGKLVSFGGLGI WSLIDVLL IGV 
GYVGPVHGSSYV 


9517 


23418 


A 


9586 


160 


1156 


MPRLT FAP KGWPHP P TS LHPGQ VTDQTT 
WWLFQELPTPSENSMPPGLSTPTASQEG 
AGPVPDPSQPTRRQIRLSSPERQRLSSL 
NLT PDPE ME P P PKP PRS CS ALARHALES 
S FVGWGLPVQS PQALVAME KEEKE S P FS 
SEEEEEDVPLDSDVEQALQTFAKTSGTM 
NNYPTWRRTLLRRAKEEEMKRFCKAQTI 
QRRLNE I EAALRELEAEGVKLEIALRRQ 
SSSPEQQKKLWVGQLLQLVDKKNSLVAE 
EAELMITVQELNLEEKQWQLDQELRGYM 
NREENLKTAADRQAEDQVLRKLVDLVNQ 
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M 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, Y=possible 
nucleotide insertion 














RDAL I RFQE ERRLS ELALGTGAQG 


9518 


23419 


A 


9587 


419 


130 


RRDPHLPRIMGRTRLNRKNAETRINAQL 
PLTDKGRMARHVLDNSGEWSVTKRQVIL 
LHTELERSLEYLPLRCGVLTGLiAAIASL 
liYMLTHYLLPYA 


9519 


23420 


A 


9588 


2 


399 


FS CE FNMEANQCPLVVE PS YPDLVINVG 
EVTLGEENRKKLQKIQRDQEKERVMRAA 
CALLNSGGGVIRMAKKVEHPVEMGLDLE 
QS LREL I QS S DLQAFFETKQQGRCF Y I F 
VKSWSSGPFPEDRSVKPRHV 


9520 


23421 


A 


9589 


19 


205 


LYCLVEWHDLCSLQPLSPGFRQFSCLNL 
LSSWDYRHMPPRPTIKLSSYSSIKLIYY 
IiFMLVL 


9521 


23422 


A 


9590 


500 


407 


KLTNE VRS VKRS QLFPES FMCKEKLGNT 
Nil 


9522 


23423 


A 


9591 


727 


236 


RRSRGLEGREALALCPGDGSHIiLCRRTD 
S S F S FMAFF F T FMAQL VI S 1 1 QAVG IPG 
WGVCGW I AT I S FFGTNI GSAWML I PT V 
MFTVMAVFS F IALSMVHKFYRGSGGS FS 
KAQE E WTTGAW KN P HVQQAAQNAAMGAA 
QGAMNQPQTQYSATPNYTYSNEM 


9523 


23424 


A 


9592 


381 


278 


HPSVYKVASGLKEGLSLFGILNRCHCKW 
GEKLLR 


9524 


23425 


A 


9593 


412 


156 


SQRCIiSLGCHEHLAMAYAI I ICDNGFSA 
LLGP PLPGW I YD I AQKYDFS FY I CGLL Y 
MIGILFLLIQPCIRIIEQSRRKYMDGAH 
V 


9525 


23426 


A 


9594 


263 


50 


KHAAP PAS LS LS LLLHHGQ KRACFP FAF 
CRDCQLLEGSPAMIiPVQPAKLIiVLLEQV 
CLLCALLIPPSGSRI 


9526 


23427 


A 


9595 


432 


309 


GTFSERGPPLEPRSQTVTVDFCQEMTDK 
CTTDEQPRKDYT 


9527 


23428 


A 


9596 


457 


29 


TLLPGWITAQASEGEIGASLPSSLPPLL 
YLPSWRCSPSGALTHPAACPTGQRSSSA 
LLPSLPQAPPGFCGMTGLRPLPSPSVGC 
CSAPSPQLLCPTQIiPPAPESSEGGCSES 
RCVANVKYTRDLGDFLEKCTNGEASALE 
YP 


9528 


23429 


A 


9597 


864 


407 


PSRRLSPTGRTTHGSSAACAPPPPRLER 
RSRTSRPPWGPPHSASARTQRACTTWYS 
RSWACSCSWITSRPSWMTTKRTCMIFST 
I PTTPRTALWRGLSLWKDAWLLTRLKLA 
PSSPTSMPPTTTCTACSSTWMTCGSPAR 
WASTPMPRSSTT 


9529 


23430 


A 


9598 


421 


145 


L TQHNGDAAAS LT VAEQ YVS AFS KLAKD 
PNT I L L P S NP GD VT S MVAQAMG VYGAL T 
KAPVPGTPDSLSSGSSRDVQGTDASLDE 
ELDRVKMS 


9530 


23431 


A 


9599 


1013 


604 


PGR P TRP D I CLLL E RMEE Q VKNVMKT FR 
EELYNIEKAFEVERQELIASNKKKWEQA 
LQAIINAKELEYLNNRMKKVEDYEKQLNR 
QRIWDCEEYNMIKIKLEQDVQILEQQLQ 
QRKAIYQLNQEKLEYNLAGAEEER 


9531 


23432 


A 


9600 


409 


191 


HHVGQAGLELMTSSDLPSLASQGAGITG 
VSHCARTVYGLS I VFFSSHRGFASVRTP 
P SNENFPDRP I S I AHP 


9532 


23433 


A 


9601 


367 


1 


P PRYS PPLGGVRGGVP PGPG VLAP PFPK 
GKPPFFFKNPNLPRPGGGAPYSPFFGGV 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














GRGIS FNPGGQGS PGQKKPPGP PRRGTK 
GFFSKKKKKESTKLPVTSHTTHCWFLPE 
ALDLELDPLY 


9533 


23434 


A 


9602 


1722 


394 


GWNGSWNDNLVDTSPLKRDPLQDICRRY 
MEDLKKI CFYRELNSKTTLKFVHTS FHG 
VGHDYVQLAFKVFGFKPPIPVPEQKDPD 
PDFSTVKCPNPEEGESVLELSLRLAEKE 
NARWLATDPDADRLAAAELQENGCWKV 
FTGNELAALFGWWMFDCWKKNKSRNADV 
KNVYMLATTVSSKILKAIALKEGFHFEE 
TLPGFKWIGSRI IDLLENGKEVLFAFEE 
S IGFLCGTS VLDKDGVS AAVWAEMAS Y 
LE TMN I TLKQ Q L VKVYE KYG YH I S KTS Y 
FLCYEPPTIKSIFERLRNFDSPKEYPKF 
CGT FA I LHVRDVTTG YDS SQPNKKS VLP 
VSKNSQMITFTFQNGCVATLRTSGTEPK 
IKYYAEMCASPDQSDTALLEEELiKKLID 
ALIENFLQPSKNGTGSGRSCLGVPPNTV 
MTLCGAYGNRATRRNCHTLEPCG 


9534 


23435 


A 


9603 


14 


356 


DFVERTQYTHTQTHTHTHSHAVRLSPRP 
VQSRPEASQTGPTRTKPALGPAHPSPRG 
AP C P DD PGS LRMLG YGS RAS HNS RRGRW 
GHGRCKDQRAMG PHSHMWEAAL P PGALS 
SP 


9535 


23436 


A 


9604 


477 


264 


LVETGFLCVGQAGLELLTSGDPPTSASQ 
SAGITGVSHRTWAAIiSTFTVLCNYHYLF 
LEVFHHPKLKLS PH 


9536 


23437 


A 


9605 


1 


378 


EGINFSDNLRRGWMVGMPFPNIRSAEL 
QEKMAYLDQTLPRAPGQAPPGKALVEWL 
CMKAVNQ S I GRA I RHQKD FAS I VLLD QR 
YARP P VLAKL PAW I RARVE VKAT FG PA I 
AAVQKFHREKSASS 


9537 


23438 


A 


9606 


417 


254 


MVSLTQELCPVAMRVAEGHNKMLSNVAE 
RVTVPRNFIRGALLEQAGQDIQNKLE 


9538 


23439 


A 


9607 


404 


272 


PPKFLLESGPIRWDWWSLGALMYDMLT 
GAVGAQLKAACI IGLC 


9539 


23440 


A 


9608 


409 


205 


HMNGSLGSGDIDGLPKNSLNNISGISNP 
PGTPRDDGELGGNFLHSFQNDNYSPSSL 
PCSWSLFYSLPF 


9540 


23441 


A 


9609 


449 


263 


VWICPCDPPASASQSAGITGLSHHLPKC 
WDYRLEPPRPAHCFCFYSFTMRNRNLLS 
LVKYSR 


9541 


23442 


A 


9610 


366 


75 


T PGLKRS S CLS IMS RWGYR I E KKRTI I R 
SWEAIKEQDGREVDWEYFYGLIiFTSEN 
LNLVHI VCHKKTTHKLTCDSSS I YYPQT 
RLKRKQ P VRKRQ 


9542 


23443 


A 


9611 


1 


801 


PGYCGSWVFTCGALRQLSGGRDIiSRGAR 
MGNSALRAHVETAQKTGVFQLKDRGLTE 
FPADLQKLTSNLRTIDLSNNKIESLPPL 
L X GKFTLLKSLS LNNNKLT VLPDE I CNL 
KKLETLSLNNNHLRELPSTFGQLSALKT 
LSLSGNQLGALPPQLCSLRHLDVMDLSK 
NQIRSIPDSVGELQVIELNLNQNQISQI 
SVKISCCPRLKILRLEENCLELSMLPQS 
I LiSDSQ I CLIiAVEGNLFE I KKLRELEGY 
DKYMERFTATKKKFA 


9543 


23444 


A 


9612 


179 


329 


KGLAFCPPPGRGGGPPPPLWPPPPSYKK 
IFGLKTPKSGALSPPPPPPGNF 
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SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=PhenyIalanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, I=Isoleucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


9544 


23445 


A 


9613 


410 


298 


PRFTPFSCLSLRSSWDYRCPPPRPANFL 
SLSSVLHGT 


9545 


23446 


A 


9614 


1 


130 


NPGSHCVAQAGLQLLSSGNPSASASQSA 
RITGVSHCAWPNTTI 


9546 


23447 


A 


9615 


1 


146 


NTWRLHMVPRLVSNFWAPGIKLSWPPKI 
LGLQKRATTMPGPE I YTS IN 


9547 


23448 


A 


9616 


552 


26 


RLS I RNLLTHAF FAEDTGLRVEIiAEEDD 
CSNS SLAI RLW VEDP KKLKGKRKDNE AI 
EFSFNLETDTPEEVAYEMVKSGFFHESD 
CKAVAKS IRDRVTP I KKTREKKPAGCLK 
E RRD S Q C KS MGNVF P Q PQNTAL P LAPAQ 
QTGAE CE ATEVDQH VRQQL I QRE PQQHC 
SSVTGDL 


9548 


23449 


A 


9617 


420 


48 


FSQILRASYEFDSPFWDDISESAKDFIR 
HLLERD PQKRFTCQQALRHLW I SGDTAF 
DRD I LGS VS EQ I RKNFARTHWKRAFNAT 
S FLRH I RKLGQ I P EGE GAS E QGMARHS H 
SGLRAGQPPKW 


9549 


23450 


A 


9618 


190 


285 


RQGLTMLPWLVSNSWPQWLLPWPLKVL 
GLQV 


9550 


23451 


A 


9619 


392 


308 


SPIKPKIPLSAPRKNTOSVKYRLKFRFG 


9551 


23452 


A 


9620 


166 


23 


FCNPIPSSLPSSLPSFISPFLPCFLTPF 
LPCFLPPFLPFLSFMTLTF 


9552 


23453 


A 


9621 


49 


374 


DRRG I R I MAAALF VP LGFS LLGTHGPS G 
AAGTVFTTA* YLGS KI LLTCS LNDSATQ 
VTGHRWL KGRWL KE D S \ L P GRKPE FNV 
GPHPPNGDKSSFVFPPPSPCPRLTF 


9553 


23454 


A 


9622 


85 


472 


SHVFPPLRLTLTLHSMAVPETRPNHTIY 
INNLNDKIKKDELKKSLYAIFSQFGHNL 
D I LVSR I LNMMGQAF VI FKE VS SATNS L 
RSMQGFPFYDKPMRIQYAKTYSYI IAKN 
KGTFLS C/ DRKL I K* TPQ 


9554 


23455 


A 


9623 


170 


2 


IFFFFCFERGSCSVA*AGVQWRSYSLiL* 
S*TPG\SSNTASATSVAGTTGL\HHHAE 


9555 


23456 


A 


9624 


2 


469 


RRLCSDRGLQRSLGSMSAAVTAGKLARA 
PADPGKAGVPGVAAPGAPAAAPPAKEIP 
EVLEDPRSRRRYVRGRFLGKGGFAKCFE 
I LDADTKEEFAGKI VPKSLLLKP YHR* K 
MFMEIFIHRSLGHQHVL*FHGYF*EHDF 
MSEELEL\CRPRSFLF 


9556 


23457 


A 


9625 


661 


1393 


ASPPPSQTTPAAPGCSAGEVSLA\GLCL 
QERRLVDVAGFS I F I PSRYLDHPQPSKA 
EQDAS I PPGTHEALLQTAIiS P P P P PTRP 
VS P PQ KAKE APNTQAQ P I S DD E AS / RWG 
RNPG*CS*SGSPHQWPLPHLP*GRL*A* 
PSPCCPG*VG/PVHSPQARRRPITTSIS 
QRSWKFWPKPTVFRETSGGPWAMPRPSM 
PSRAS I SLSPRTRSRKQPRPLTLGCCVW 
HWHTDGERATCGD VRRAHHS PR 


9557 


23458 


A 


9626 


1 


286 


PTMAKLIALTLMGMELALFWNHQASYQT 
RLNALRE VQ PA* LPNCNLVKGF * TGS ED 
F\EILPNGLTFISSGLENPWI *RLDP\N 
SPGKMLLMDLNE 


9558 


23459 


A 


9627 


173 


491 


E G P LP LE S S SNWGQAD LD KKS RE LLWKT 
VYLL*LNIQLPYNPVNPILGINPRETHT 
C\HQEMCTRIFIATLFTKAKA*K*PRCP 
*AREW/IK* / 1 WNIHI IEHY 


9559 


23460 


A 


9628 


351 


1 


VLGDAIQSRRGSSRKAGSYLLSRSP/CS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Pred ict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possib!e 
nucleotide insertion 














E GGRGG CWAWNLGTMS S VAS GHGAW VP R 
ELTS * AWPEPGPAGQREGRPPPTPCPPT 
CAHGPPAQP * PRGKAAYPG/ CLGLQWPD 
*SAFNP 


9560 


23461 


A 


9629 


599 


11 


KMWR* *H*AKSQIRNAIPFTIAHK/RDL 
KS LKI QLTKEAKD I YNENYKI PL/NEIR 
DVTNKWKNI P\ CS * I GYLYS T YKYKCI C 
LVNIYL 


9561 


23462 


A 


9630 


226 


1 


KCDNLGYSTFFFFFFPFFFFEIGSYFVA 
QVGV* WHNYSNS /DASAFQVAGTTGVCH 
HAQLI FNLFVEVGSHYVAQA 


9562 


23463 


A 


9631 


241 


1 


VSGCPLPRG*RPPPQPQSPPPSTPGTSD 
HPCTP CR/ PALRR* S P * *LPCLDLPPAS 
GP PSTSGNS TGTMQPT * PKLNS S SS 


9563 


23464 


A 


9632 


201 


1027 


MTLTPAPGQGQQSSQCL.WRRPL/PLPPG 
PDSEEPGSGAAG/PGSSL*SPDLPPAAP 
PKP*TSS*EMNTVGSTQEGSGLAQRTTQ 
HCPVPPST/RVDILGTCLQPFPALCLTS 
APWLVRMPPSQRSSREIP*RLEAF/SLR 
T P I PAH S AQ P SHQE KE P / P S PR * DHGD * 
PPP*PPKPTHQTIiAGQAPRSGQGLARPT 
LPYPPSPLRVTPRICRVHV*GHLRRPRG 
PPPSSARRPTWDWRVGLPWEDLEWLCR 
GPAWLLARTSAPFSSLPTHACTGKKTTM 
PF 


9564 


23465 


A 


9633 


103 


379 


YHCITIIVLKCIFFFFFLKPCLILSPRL 
EARGI I LGH * NLCLT \ SQLLRRL I RKS P 
LTLEGGVCSDPRS PHCS PAWPTKGNFFS 
KGKKKILD 


9565 


23466 


A 


9634 


155 


405 


VP AFRS VTN I S I S VS FFFFETEPG F VT Q 
AEMQGLDIGNGQPPCPKLSSFA*ASQVM 
GTTGPRH/HC* AL I FLVKTGFPHVAKTW 
F 


9566 


23467 


A 


9635 


4 


423 


GAMRGDRCRGRGGRFGSRGGPRGRFRPF 
VPHI PLDFYLCEMAFPRVKPAPDETSFS 
EALLNRNQDLDPNSGEQASILSLVTKMT 
YE I DNLT VAPGTLD VL I EERQQGGP YI K 
GTMTT*RH/ SVADLTVILQILTTWESVA 
S 


9567 


23468 


A 


9637 


2 


492 


GSRAIGFGLAHELERHLIKIWLAS IVR* 
DIiA*LMNWSG\SHKDLAGKYRPILEKAI 
QLSGSEQLQAFKAFEESMVNENVSLVIS 
RQMLTDFCTHLPNLPDSTVKEIYHFTLE 
KIQPIVIS FEEQVAS IRQHFAS I YE KEE 
DWRNAAQVLVGIPL* TGQKQYNVDYKLE 
TYLKIARLYLEDDDPVPAY 


9568 


23469 


A 


9638 


1 


417 


WQGLGLVMGSPSRRLQTKPVITCFKSV 
LLIYTFIFWITGVILLADGIWGKVSLiEN 
YFSLLKDKATIVPFl\LIASGSAIILFG 
ILLCIFSV*HYPV*LLLVTFLI *LLVFL 
NKWWISPHFLILTFNYLLFDIIIDNSS 


9569 


23470 


A 


9639 


125 


451 


IYRKPLKSIRRVGGLHLNKIFFFFLKQI 
WGWARGLMP C * HFGRLKKGDCLTLG I GD 
/SRG/RYDHDPALQPRQQSKTLSLKINK 
I GWAQWLMPG I PAPWEG * AGRS FGAGI 


9570 


23471 


A 


9640 


670 


927 


GHVLL P RRAGCAQ PGTGPARAAP AAVQR 
LPGGGVQPSGPGGRAGQGQGLRGT * AGG 
NPGTGF/GGPHASALLPPSEAPGSLLPS 
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09/515,1 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K— Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














QPR 


9571 


23472 


A 


9641 


115 


460 


ISWIFLIICVGFFGGGGGARVPLCHLSP 
RLECSSAIRARCGFDFPGSSDP*ASAS * 
VAGTT/G/ SCHHVQLIF 


9572 


23473 


A 


9642 


1 


431 


AMRAE VL VY C C RGALAS G C VQ AFRAR Y \ 
WVL 1 I VDVPDNEQANAS 1 I VKDTDS FTE 
QADQLTAEVGKLLGE*KVDAILiCVAGGW 
AWGNAKFKSLFKNCDLMRKQS IWTWTIF 
SHLATMHLKEGGLLTLAVAMAVLDGTPG 
MIGD 


9573 


23474 


A 


9643 


154 


408 


I IMNAQS VEEDS IL I I PTPDEEEKI LRV 
KL*EDSDGKYGSKZPWNHLPEPENFRLIi 
FKHVG YLD* SGL Y\ E I LTKF * I LCVLC Y 
Y 


9574 


23475 


A 


9644 


3 


403 


YSQFFGGPKLEDPIRPGVFSQPGEHRGH 
PFQ I F* KVGLGNGARPGFP VLGKVKCKD 
NLKLGGQGCSRVEMHPRTS AWAIERD PV 
LKKKKRQR\EHQCSNEQKHTN 


9575 


23476 


A 


9646 


3 


399 


ARAM VhS PAD KTWG KAA WG KVG&HAG E Y 
GAEALERMFLCFLTTKTYFPHFDLSHGS 
AQ VKGHGKKGAD S LTNAVAHVDDMP S AL 
VALSDLHAHKLEVGPVNCKLLRH*LLVT 
LGG\HFPAWLTPGGIAFLEKF 


9576 


23477 


A 


9647 


22 


407 


APSAWGMCHFTEEDKATITSLWGKV\NV 
E DAGGE TLGRLL VVYP W TQRF S DS FGNL 
S CASAIMGI P KVKAHGKKALTS LGDAI * 
HLDDL*GTFAQLSELHCDKLHVDPENFK 
LLGNVLVT VLAI H FGKE 


9577 


23478 


A 


9648 


243 


2 


Q I ATTTLGPGS EG VGKRDLGS S S RHHS Q 
VESVKH\PRDLAQDA*GHGSAPRPLCPP 
PAQSWHCLRGPCLAPHHPTLiCEPSR 


9578 


23479 


A 


9649 


283 


196 


SYF 1*1 ITTTHSLI \ IS I I PLLFFNQ IN 
NNLFSCSPTFSSDPLTTPLLILTT*LLP 
LTIMASQRHLSSEPLSRKKLYLSILISL 
QISLIITFTATELIIFYIFFETTLIPTL 
AIITR*GKPTPLIQ 


9579 


23480 


A 


9650 


10 


354 


QLRVI TRLTWNN* TSA* SRGVS YP * AH 
\SSDALHL*RELTEQRFIVSVNCAIAHC 
RHQSAS EDHVKLRNE VTE FAQTWDADE S 
AENCDKSLHTLFGDKICTVATLRETYGE 
MAD 


9580 


23481 


A 


9651 


2 


404 


AFGTTKWVTYISVLFLCSSAYSRGVLRL 
DAHKSEAAHRFKDLGEENFKAIiALIAFA 
Q YLQ Q C P F \ E DHVDLVNE VTE FAKT C VA 
D E S AEN WD KI \ LHT L FG Y* L CAVAC FRE 
T \ YGETAAC* AKHE P VEDE \ C FLR 


9581 


23482 


A 


9652 


83 


396 


NI PGVGGGGVP I PLVWKYNI SHTQR*RE 
PYIQSLRSGNWSERMSHMVAR*RQRHHP 
YYI \LLQAQLSMA\ SDPANVMMDSVITA 
GVLVA/ SRRARRI S KLGHRI 


9582 


23483 


A 


9653 


67 


954 


REGNHNTERNCKRP PQDTGPTQ / RPGP P 
PPEVPWQDASS APTPAAAPVGKLLPS * G 
PPGSAS CQGR * TLGSVHWL * GS PLPS PS 
AGGRALPYGPAQGQSQGHWENTGRGTPL 
PGSRET*GHELTTGVEHQRHKAPREPQA 
GQPRLCPWQSLPPT\ARSPSWSQGERP 


9583 


23484 


A 


9654 


1 


252 


RPPQLLASDSLPPSRPPSFPPSLLLSLR 
PPLSRSLCIiSHYLCPPIiSLIQQLTSFPD 
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nucleotide 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TFSPPKSTTSGPPQPEDSPSS*PPTPSP 
P\PPSLLSSLPPSLSSAPPF/YVHSVSP 
TI SAPLYP * YNS * PHFPI PFPPRKVQHL 
ARPSPKTARPPWTIRRI 


9584 


23485 


A 


9655 


1 


394 


SLRIYCFVIEVICFLYLL/FIMYSIS*G 
I*LLMNFTILCIGY*LLQYFFFVWKSF 
YFNFIRIPFILFLYLICLLVF*LIFFDL 
SYF /MCLRL 1 1 FNLF Y* LFLLYS I CLWN 
MVLLLISKLF* F* FCGRFALIQ 


9585 


23486 


A 


9656 


122 


371 


S P YLLIiLQGGVS W I VWLLNVE STNS S FL 
FPETGSHCAVQGGVQWCGRGSLQS*TPG 
/SQS * VAGITGMCHHAS*LEKFFFVEMG 


9586 


23487 


A 


9657 


425 


3 


KT S MMNS IVPHISIPT LNVNGLNVPL KR 
YRIAE*IKIHQPSICCLQETHPTHKDSH 
KLKVKGWE* IFHANG\QGVAILISDK/T 
DFKATTVK/RDKEGHYII IKGLVQQENV 
TViNLCAPNTGAIiKFIK\ QLLLDLRNE I 
NANT 


9587 


23488 


A 


9658 


179 


373 


VHSWLLWGSQSFVLFRPLTD*MRLTH/I 
MKGNLLESKSTSLSVNLIQKHPHRNIQN 
V/AQNIWVSWNS 


9588 


23489 


A 


9659 


103 


373 


LKHVLRLCLQSERKLHYLFNIDQWGKFY 
TWDDLYECISSKFTTHFFFSFL/FFSF* 
SQGLALSPRLDCSDAI IAHCILEFLGSS 
HAPTSAS 


9589 


23490 


A 


9660 


216 


424 


TDINVTTKTIKPLDENIGITLYDLGLGS 
GFLDMT * T/ AQVANEKI G * WDFIRNLKL 
TCIGHYHEVKKTNPW 


9590 


23491 


A 


9661 


276 


2 


PRIGKLPTNMIEPIillKEMLINLAMRFH 
LS PVRMAI / 1 KI KKSNRCW* GCREKGI \ 
IYHYWWECKLVQPL/WKSSLK/FIK*LP 
IDLSFVPAIPLL 


9591 


23492 


A 


9662 


239 


356 


I C I KRRK/ WS GTVAHTCNP S S LGGQGGR 
IA*VQEFQPGQQ 


9592 


23493 


A 


9663 


1 


340 


RHE LMTPHAFGAMKRVT F I S LL FLFNLA 
YSRGVI SRDAHRSEDAHP I KDLV*ET FK 
ALVM I AFAQHLQQRPFEDHV* LLNE \ VT 
E FAKT C VAE E S AENCD KS LHTL FGDK * C 
TV 


9593 


23494 


A 


9664 


759 


1039 


KR I P FGRPRRVDHLRSGVRDQ PGLHDET 
PS\LKIQKLAGQGGICL*SQLLRRLR*E 
NHLNPGSRGCSEPRSCHCTPAWATE*DS 
ISKKKNTKIS 


9594 


23495 


A 


9665 


1 


355 


ITDLYSMFHFG*VDLATITSLWGKGNEQ 
KAGR* TLERLLDVYPWTHRFFDS FGNLT 
S ASA I MGNP I VKAHG KKVLT S L GDA I MH 
LDD/LKGTFAQLSELHCDTLHVDPENFH 
ENPCDGS 


9595 


23496 


A 


9666 


364 


3 


APNIPPSVTRIPALLAFRSQCLCGSCPL 
PLATEPVGLLLSMSFQAAPCFTWWVTEA 
WLTGISWPPDFLISLLCI/HINYNAVIF 
RKYRPRVGAVAH I CNPSTLGG * GGW I A* 
AQEFETSLGN 


9596 


23497 


A 


9667 


1 


357 


PISNAMRHLGQEETATITSLWGKVNVED 
AG*ETLGRLLVIYPWTHRLFDSFGNLSS 
ASAIMGNP*V\KAHVKKALTSLGDAIKH 
LDDLKGTFAQLIELHCDKLFVDPENFKL 
LGNVLAT 
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Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine ? 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T— Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


9597 


23498 


A 


9668 


24 


369 


APRPDAMGHFTEEDKAT I TSLWGKGNVE 
DAGGETLARLLGVYPWTQRLFDSFGNLS 
SASAIMGNPKVKAHGKKVIiTSLGDAI *H 
LDDLKGT FAQ \ LS E LH* DKLHVDPENFK 
L.LGK 


9598 


23499 


A 


9669 


12 


365 


LLLMDGERK* FLETDSAPCEDAMNTVEM 
TTKNLE * S INLVDKAVA/ SGFERI GFNF 
EKS STLGKMPSNS I ACCRE I FHERKS / S 
MWQTS FLS YFKKLRQA/ P PAATTLNDHQ 
PSALRQDPSP 


9599 


23500 


A 


9670 


346 


2 


YPYDPWSLINNMFFLFICRFQNFFLKEGT 
LTFPFKGFRGDLSLFFWPPYKLFNLKSF 
I*/CKFLEI*RYFFG*IIFFFPFFFFFFIi 
RQSLTLSPRLECNGTISAHCNLCLjPD*S 
NSP 


9600 


23501 


A 


9671 


121 


364 


HKKKTAGRVQWPLLVIPALWEAKEADCL 
SSGVQGQPGQHSKTPSLP*PSKSWDYRC 
TPPHLASFCNFFSFFFGRDGVLLCCPGW 
P*TPELKQSA/CLGLPKCWDY/RARATA 
PGLLFF\FMLKYVLVKPAVNIANAN*KE 
LSEPDVKNEICFV 


9601 


23502 


A 


9672 


437 


223 


EME/ FSLLLPGLECNGTCLAH*NLRLLG 
S SDF PAS AS PVAG I TGVRLHAQLELYFL 
NLLGFFFIISLWYLS 


9602 


23503 


A 


9673 


100 


338 


KSRPSAVANLTPVI PALWEAEAGGSPEL 
KQFSLSLLSSRDHRFATPC/RG*FFIFF 
VEMEFHCVAQCGFELLDSSSPPTSA 


9603 


23504 


A 


9674 


161 


1 


S FLWKLCLRGAPS CMRCQLA\ LL* DVSQ 
LGY/ SGVRDPLEEAVCPFSDLKLRAGR 


9604 


23505 


A 


9675 


304 


1 


VPPLASQLGDKRRIFFQKKKKKKKWKDK 
IHI I IS I ILKKFDKI * YSLI IK\TL*KL 
GME*TYLNI IKVI YDRPTAS I ILSGEKL 
KSFPLKSGR*QECPLL 


9605 


23506 


A 


9676 


111 


346 


SKEGSKEGSLEIWGFIiVLAFPFPFPFPF 
SFFFF\ *RQAGRPRLECRSMITAHCNLE 
LLGSSDPPASAS \QVAGTTGPTSP 


9606 


23507 


A 


9677 


3 


349 


ARAGRI IKDLWLGFFFFFFFFYPKKTPL 
FTRDKGGGKLQTAKKNNPHPVKREKKKP 
PPGGTRGGKNPPGNPFWGFGPQRFFFGG 
KTFGGGAHPPDPTLG/PSPQS*KRPFGK 
RGPPP 


9607 


23508 


A 


9678 


38 


368 


QASLLKESESERKSPTSLTLSQKLGMIR 
LCEEGMSKAKAD*RLGLLH/QVSQLVNT 
KEKLFKEIKSAISVNTLMIRK/RNSLTA 
DMDKWVAWIEDLSSQNNPLSQSLIHSK 
A 


9608 


23509 


A 


9679 


187 


345 


PFILFFETESYSVA*AEMQWCDLGSLQT 
PPPG\SSNPPVSALWEARAGGSRSQ 


9609 


23510 


A 


9680 


192 


468 


ERTSLFTIHLLKLLTITNSAAMSNHV*V 
' FV*T*IFLSLGQMPKSVTAGSNGQSLFR 
FFK\NCQTLFHGGCAMLH/ SHTQLNAI P 
LFRNLWQHLS 


9610 


23511 


A 


9681 


141 


367 


GS WAAS PCS CTMDDFAKES FTWDYVLL 
ENCPNMGD YWAPQ FMTDNYVRVTQLNW 
DGVGTQ* KDYI /SSERNL 


9611 


23512 


A 


9682 


193 


3 


I FGERR* I LKI YKLTGHVGLRL * S Q I LR 
RI /RLNLGGGGCSEPRSCHCTPVWATER 
DSISEKKEK | 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
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09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V= possible 
nucleotide insertion 


9612 


23513 


A 


9683 


1900 


1988 


NLVHFEEHFPSLFTPSPSLCHIF/CSRD 
RVSPY*PGWSRSPDL/RYLP/CLDLPKC ' 
WD YRRE S P CPAS S RFLMNFL Y F * NTTLW 
Q*KTIPLLF*SGDC/S*FF*YNFLAHTS 
S TMVHSGI SRNS RP VFKLADKAPSHS PL 
SIIAFGVEFF/NLGSLQPQPLKLK*ASH 
LNCL S S WDNRHAP P C P DNF F FN I L * RWG 
FPILSRMVLNSW/PLCSLSTSASQSSG/ 
ITGVSHCAGPILVF 


9613 


23514 


A 


9684 


69 


358 


ILIVKVTFPHTQKPNFVLRLSLQS IQQM 
LLPERKWTHAQKTCVSSENTERCHEKQV 
I TSHWLGMVAHS CMPSTLGGGGGWI T * G 
QVFET/SLANTVKP 


9614 


23515 


A 


9685 


66 


344 


IGEKLLLTDKHRNWVLETE*APGKDSVlSr 
IDEMT/TR/DLEYYIN\TDKAVTGFERT 
DFNFKSS IRGQSL* YSII/CAKGSQQKQ 
Q ISLVP YFKKLPQ 


9615 


23516 


A 


9686 


269 


1 


PKIQNGHCKTLPKNPSQSDTKTFPLFTS 
GSSPQPHKLSFCFSAGPTLPSKTQLKIH 
L\WQGTVPHAYNPSTLGSRGGRIT*GQE 
FKTIQ 


9616 


23517 


A 


9687 


229 


361 


DRYYYKSHFKPGAEAHAC*LVLASFFMC 
YRAIMINK*HLYHRKL/VYQVIRH*KYR 
D / 1 * THVTLTTTL * DRYYYKS HFKPGAE 
AHACNPSNLGG*GG* I I*GQEFKTSLAN 
MA 


9617 


23518 


A 


9688 


318 


46 


KL YALNDMASNTCKE YDNL I Q YYRYKI Y 
KRLIQHD*VEFIPEME/NWFNI*/RKSI 
NVIHYIHSLWEEKNYMIISLDVADVSDK 
IKLTFLIF 


9618 


23519 


A 


9689 


168 


3 


GNSNTMIFFTILYLFFLYFF/LFFIFDM 
KSHSVARLECSGVILAHCNV* LPGSSNS 
P 


9619 


23520 


A 


9690 


192 


3 


WHKASLSNPQPAGCMQPRMALHEAQHTF 
VNFLKTLWAG\ P VAHAYfTPNTLRGRGGR 
IT*GQEFK 


9620 


23521 


A 


9691 


388 


1 


CRSAGVCWRSTPDPVCLGITSSGCRTAE 
IVACSFLWKLHSRGAPARCQPELSCMRC 
\ R * T LL * G VS QS GGTG VRD P LKE AVG PL 
AELENGARRSTALFRASRQDSLSLLKLR 
PQLPLLPGALSQIEGI 


9621 


23522 


A 


9692 


226 


3 


WFLLPQVKLAIMTPRLYFPNLPFFFFF* 
DRV/CTVAQAGVQWRDLSSLLLSLLGSS 
S S PTS AS RLTGT 1 1 LQHHAE 


9622 


23523 


A 


9693 


305 


1 


NHAITTVN* FGLIRHLVTKAAFNSGEVD 
IV/SIKDPLIALNYTVYMFHHDSTHGKF 
RGTVK 


9623 


23524 


A 


9694 


320 


1 


P RDP PAS AS QS AG I TG I SHQAW PDM I L Y 
IENPKDSSKNPLGLINKYSKVAGYKINT 
QKS AAFL * TJSIU YLKN * P / MRT I P FT I AT 
KKKYLETYLTMEVKDLYTENYKM 


9624 


23525 


A 


9695 


263 


1 


VRILLFSESLALSPRLECSGVISAHCTL 
CL P GS RGAL S LL S RVD Y / Y* VFLVKAG F 
RHV/GQAGLEFLSSGDPTVPSLPDFWDY 
RCKPR 


9625 


23526 


A 


9696 


262 


359 


IRKIHKIWSGWAHTCNPSTLGGR*TKA 
HEGSLYCLRVGKI VSNKVGTRS * FFRTQ 
K*V/HLFFS*VYHINGSH*SRTSL*IRK 
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peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
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nucleotide 

location 

correspond 
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residue of 

peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=As par agin e, P=Proline, 
Q=G)utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














IHKIWSGWAHTCNPSTLGGRVGWIT*G 
Q 


9626 


23527 


A 


9697 


166 


473 


TQDTSQPWKNIWISFGSLRVFENVDIHK 
Li YL E LDNRHR S WMTD I LL F F Y F * DG S FT 
VT \RLE CNGAI L AH CNIiHL P G S SNS PAS 
AS * VAE VGGTLE PRRLRLQ 


9627 


23528 


A 


9698 


456 


59 


LFLWKLHLRGAPGYVRCRLA\LLGGASQ 
LGYSGVRDPLEEAVCPFSDLKLHAGRTT 
TLFKAVRYGHLCLQKFLLPFVRLCPAPT 
GGVFTVRQASMS CDGLLI V* \VSERH* L 
PKPQQFWELVPKFVRPGLKSL 


9628 


23529 


A 


9699 


4 


375 


VFCS S VTI VSE I AFLMWLS AQLLL VYRN 
ASDFCMSSLYPETSLTKVTQGQAL/H*N 
G*LT**EQYWVKTDQKAVI/CQQSTGSS 
GQ S S QARE I KG P GAVAHT CNP S TLGGRG 
RWIT*GQEFETSLA 


9629 


23530 


A 


9700 


239 


361 


RNDTFFF FETE SNS VA\RLQCTGVTIiAH 
*NLCLPGSSNSPA 


9630 


23531 


A 


9701 


169 


1 


I FYWKAI FFETESHS VAYAGVQ* SNLGS 
LQLPSS\GSSDSPISASQEAGTTDAHH 


9631 


23532 


A 


9702 


37 


348 


HAKNFDVSLLPYTKTHEHFVKEDIWIAN 
KHVKRCSATLVTKE I PPQS /TLKWLKH* 
PPC*QGNGRTGSPIP/C/WWECKIVQLW 
KIV*PFPKKLSIYL/PY/DPPILLLGI 


9632 


23533 


A 


9703 


16 


339 


ARLNTFAMAAKC* SERKS / RN/SLTLNH 
KLQM I Kb I EEGMLKAVNGQKLRLL YQ / T 
S QFANAKDELLMD 1 1 SATPATV* Ml G * Q 
NSLIADLEKVRVV*LDDQ/TSHNIHLSH 


9633 


23534 


A 


9704 


276 


2 


GISRGGLSKPPFFP*DFPLDYLGQSPFC 
KVKRNS/RFRTTKSPLDFPIPPLFFFFF 
E KVS Q A I S AHC KLHL P G * CHS P AS ASR V 
AGTTGTRH 


9634 


23535 


A 


9705 


352 


2 


CNSRSDFFQL.MMNCFLRWAVSPRVECS / 
GVITAHCNLNLPDS VDPPTSTS * VAGTT 
GHTP\HTWLIIIIIIVAGIFLFLFFLFS 
FFGRDGVSPCCPSWSQTSEFRQSACLSL 
PKCWDY 


9635 


23536 


A 


9706 


6 


351 


IWNSRPRRPLGGQGGRIQDWPKLHGKTR 
SL/LKKLQKLARRGRASLWSQVLKRLRL 
TWVDCLSLGG*GCSQP*SH/lHCSPAW 


9636 


23537 


A 


9707 


76 


359 


CFLFVCLDFVIFLAYFCN*YLFPFLISH 
TFFFFLKKGLIFAPGVNLRDQNKFYFTL 
FDVFFNI YYC* I FF I F / CLVLLFF IFFF 
FF YCFVMRF I VL 


9637 


23538 


A 


9708 | 


342 


3 


GEELLLMDEQSKWFLE \TPGENATNI VD 
LTTKDLEYSISWDKAAAGFERIGSN*N 
STLELKTVN/SVPHAR* IFCKRLQSVWQ 
TS LLP I LRN\ LP * S PHPLVSAML IIQQS 
STLK 


9638 


23539 


A 


9709 


211 


350 


RQDIALLPRLE*SGTITTH/CDPQLSGS 
S E P PTS AS L VAGTTGACHYV 


9639 


23540 


A 


9710 


253 


3 


VKCFS REHMLIRRS AIGGKKH I E KGTL I 
MSFWWYVSQSEH/NLCNHYAY\SHIKK 
VWLGTVADVYNPSTLGGQGGRIT *GQEF 
E 


9640 


23541 


A 


9711 


80 


324 


LITLTMASMCSNERKSHTSLTSNQKLEM 
SKPTEEGMLKAKTS *KLGLFHQT / SQLV 
KTKKKFLEEIKSTTPLNAPLMRK*NSL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
W— Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


9641 


23542 


A 


9712 


246 


3 


QLAKHGLLFFLFFEMGSH* LRLECSL \ T 
ALLSKISAHCNLNLPGP\SDPPTSASQV 
AGTTGACLLAQLIFKNFVVETGSHYVA 


9642 


23543 


A 


9713 


2 


359 


AAASTTIKEQDMIiKFYFLFYFFETESHP 
VTQVGVQWCNLCSIiQPPPP\GPSDSPAW 
VTE QD S VS KE * KRKKL I K I NV YLQ VTTH 
P *HS PQHI VRVLS VLN* LNEKINRRVTK 
SSSKEKDR 


9643 


23544 


A 


9714 


278 


53 


KL FRGLNNHR KF Q T RCGGL F FFLFIYF* 
DGA/YALSLLECSGTIMAHHSLDILGSS 
HPPISASLAA\GWRTCSPSC 


9644 


23545 


A 


9715 


166 


402 


TL S L KKNE NTS RAW WHVP WS G VWKAE A 
R*SLVPRSLRL\HCTPAWA 


9645 


23546 


A 


9716 


168 


1 


VDCII MRRS I AKS PRRKCRGT I S ARCKL 
RLSG\SGHSPASAS *EAGVTGVGHDPR 


9646 


23547 


A 


9717 


1 


553 


TS I RLFFLLS I FFFSRARNR I LFKI YCN 
GVRAKGRGGGWGTCRGHRNTMTWPGPQL 
LLSWGRGMKRQDQGWSWGGRGEGDTGC I 
PPPPGSTSRPWTPRSPWPLRLHLFSASG 
PPGSSSCSP*LFPLTDSQVGRCSGPDPN 
PPYHLMKVQPLPALLLPLLKPHRFPSLG 
Q\PDPPPSSPPSP 


9647 


23548 


A 


9718 


13 


364 


PPNR*RMPLIELTNHSLIYLPTPS\GIS 
A**NFGSLLGACLILQITTGLFLAMHYS 
P IRLQLAFSS I AH I TRDVNYG* I IRYLH 
ANGAS I FF I CL FLH I GRGL Y YGS FLY S Q 
T*NIGIILLLAT 


9648 


23549 


A 


9719 


82 


351 


GEALEPELPMPALSH/HAATLIiAPLPTT 
PFVGTKVTIVGQAQW/HQAL*SQHFERP 
RRVDYLHLFSRNGQNPISTKNTRISWAW 
WRVPVIPA 


9649 


23550 


A 


9720 


179 


365 


FSLFLASLKAQ I IDSKPFFSNTLENI * K 
YEVINFFLYl\lKIiHFMNLTSLCFYCHS 
VCTILL 


9650 


23551 


A 


9721 


2 


320 


PIPPGNESSPPTAQEDMQAANKH/IRRY 
STSLAIRET/QL/KTTTREPYAPITMAT 
TETSAWTTCWREYGESGSYCWVGCKTLR 
PLCKTVWQFLKKPSM*LAYDPAITLF 


9651 


23552 


A 


9722 


1 


344 


PLPTQNWPQGQSFLHYPVLLPAVSVLWS 
SFFFFFWKQILLLPPRLEG/NGQNSG*W 
KFPLPGPSLFSCPSFQTSGNYGPPQQAR 
AI FWKFKI KTGFHGVTREGLNFLTSGSA 
PLGS 


9652 


23553 


A 


9723 


156 


3 


VTEAYFILFYFETESHSV*WHDLGSLKS 
P P PG \ S SNS PAS AS Q VAGTTGALH 


9653 


23554 


A 


9724 


76 


337 


YLSIYLSIYLAIYVSIIYLPIYHPSISI 
YVSSIYSXNHLPLYYFSTHLSSTHSSIY 
LLS ISIL.S INYLLLS I SLS I YLS I * LS I 
YDSS IHLS IHLSHLYI YLI * YLSIYLS I 
YLAIYVS I \TIYLS I IHLSLSMYHLFIL 
SIIYLCIIFLPIY/LSTHSSIYLLSISI 
LSINY/HTSIYLSIYLFVY/HLYIYHPT 


9654 


23555 


A 


9725 


249 


436 


ELAG* I QDH P / GQHHKTLS LQQ I KNLAR 
RGGRHL*SELFRLRQEDHLTPGVRGCSE 
L*SHHSTP 


9655 


23556 


A 


9726 


177 


3 


HLN* * FSNLI FFETES PSVT \RGI IS AH 
RNPRLPGSSDSPTSASRVAGTTDT 


9656 


23557 


A 


9727 


272 


233 


RKNQRIYQIARKRLNEMARISPLRSMII 
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ing to first 

amino acid 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L- Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W— Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














LNVSGLNFPLKRCRLAEWTEKKK\DPII 
CCL/QKTHFTG/KDIYRLKIKGWKKIFH 
ANGSQ*Q 


9657 


23558 


A 


9728 


368 


380 


RKGQR I S E I AI KRLNKMAR I S PLIS I I I 
LNVSGLNFPLKRYRRAEWTEKKK\DPI I 
CCL/ QKTHFTG/KDI YRLKI KGWKKI FH 
TNGSQ*RR 


9658 


23559 


A 


9729 


294 


2 


KGNLSPKKKKNLKNFKVTPQNFGLKDQK 
TQGKPTLKVSQNQKKPKFK*NLKGKRPIi 
KL F P KKKKLNP S RS WF YE KKKK\ IXSTKLD 
GTLVRQ INKKRKNQ 


9659 


23560 


A 


9730 


7 


284 


SQGLGRLRRENYL* FETESHSVARTGVQ 
WRNLSSLQSPPPG\SSNSPASASQVPVI 
T 


9660 


23561 


A 


9731 


1 


322 


RSLKKKKRREKKGKNQPGQHGKTLFLPK 
TPKI*KIKKIYPG/RGGTHLNSQLLRNL 
RKENHLNSGGKGCNDPKLGHCIPAQMTK 
TPFKKKGGAKQNGKKLLQLIYGIK 


9661 


23562 


A 


9732 


168 


2 


GFYRKSLSCLSVWICFKDFFFFFEMVSR 
S VA* AG VQWHDLGSMRP P P PG \ SGD CP A 


9662 


23563 


A 


9733 


43 


317 


LKLQPRDTHCIWLSTTSPFIYFFFETES 
CSVAQA*MQWHDLGSLQPPSPG\SGDSP 
AS AS CTGMHHHTWL.MKCINS FDVPDMP L 
GLGLNSHI 


9663 


23564 


A 


9734 


8 


296 


NWSAGSLjNGLVKGRLSCLTLNQIL*MI 
TLTTERFLKAERGQKLGLLHQTA/QWN 
/ AKE I FLNE VS S ATP VYE G 1 1 RKLNSIi I 
ATMEKACWWIEDQT 


9664 


23565 


A 


9735 


3 


300 


TRPSNS PAALSKCS SRRKT/R / TCLTLN 
QKLEMIKLSKEGILKAERGQKLGLLHQT 
A/QWN/AKEKFLKEVRSATPVYEGI IR 
KQNSL I ANME KA* WW IEDQ 


9665 


23566 


A 


9736 


352 


1 


PFF YLLNFPTFFKGPPL IHFFLESFLEN 
FGFQGAFFKLQNFFAFLFSLLNGPFLEN 
PFYFKFPINC\KFLGQNTFYCPYLGFF* 
RSFFFFFFFFETESRPVA\RLECSGAIS 
AHCKLG 


9666 


23567 


A 


9737 


130 


3 


KIFFFFFFFFLRQSLSVAQAGVQWC/N* 
LGSLQPPPAGLKRFLV 


9667 


23568 


A 


9738 


167 


3 


ISRSVFCFFFETESCSVTQV* EQWRNIG 
SRRPQPP\GSSNYCASACRVARFTGLV 


9668 


23569 


A 


9739 


140 


419 


INVKYLSRHFFKGNIQSPNKHRKAIREI 
QIS\*TD3SnSTKCW*GCGEVQYFIHCCMCE 
MVHSLWKIVW*LLTGL/HIELSDDPAIL 
LGGIPPEEMKT 


9669 


23570 


A 


9740 


93 


371 


KLCNGYYQESKREPTERN* IFANHKTKG 
CISKTYKELQLNNKMS/NPTQKWAKDLD 
ISPQTMKQTAKNQVKRCSTSLVIREMQI 
YTKSIPLHIH 


9670 


23571 


A 


9741 


1 


288 


GTRDHLRSGVRDQ\PGQHGETRSLLKIQ 
N* GCGHGGS CLS S Q \ LTWRLRQVNCLS 


9671 


23572 


A 


9742 


296 


563 


FIKHLNS YMLGQNIoATKLSE I CKLPFYI 
EK* IQAGYGGTCL* SQLLRRLRHDNCLN 
LGGGGCSEPKI\HCCTPAWATDGGSVSK 
KKKSRR 


9672 


23573 


A 


9743 


32 


400 


DALVPHWS SYS P I CLDLNVLGI YLFSDR 
VSLLVPRLECSDAI MAQHS LDLPRLR* S 
SHLSLPRNWLMFVILVETGFCHVA*DGR 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KLLGS SNP PALA/ FLKC/ WRNMRKSQL * 
PLLLGLLS ADHS D 


9673 


23574 


A 


9744 


60 


356 


HSEVDS I I IPILQGRKQISEVRPQRMCE 
REGGLFISETLLFLCPAPSCS/HFLTPA 
Q*SYF* KNVSWLGAVGDTCNPSTLGGRG 
GWI\RGQEIKTILANMV 


9674 


23575 


A 


9745 


2 


446 


PMVNKHVKRCSASLVIRAEQIKTIMRSH 
CTPNS * M * KTDNTKS W/ *GCGATGTP I H 
C/WMKYKMVQPLWKKAQQFL\KHIHLPY 
DPINLLLGIS*EK*1SJHLC/YKKTYVRIF 
IAALF 


9675 


23576 


A 


9746 


257 


1 


WLRLSFPSFLLPILSPSSQGDLVCCFL 
MVF*RDGVSLPIPAGVQWLDRSSL*P/P 
S PGS S S P PAS AS * VAGTTGMHLHAEP YA 
SC 


9676 


23577 


A 


9747 


148 


3 


IKLCEQFHK*TIHVLNHK/L/WPGMVAH 
AYNPSTLGGRGEWIT*GQEFDSC 


9677 


23578 


A 


9748 


769 


1 


YPQLFKIAKKKKKSNVPIKSIMVSQYGH 
* ENKKW VLNHKGATLQ/EGKGNFGDTFK 
VTL KD K I AVAVKTRQERL P QE L KLKFLIi 
E AKI LKQ YNHP SNVKS IGVHTQRRP I YV 
I KELVPGGDFLS FQRKKNELKL \ VKFS L 
DADSGWC I SSKKC IHRDLAVRNCLi/ VGE 
NNVLKI SDFGMSRQEDGGVYSS SDLKQ I 
P I KWTAPEALNYGRYHSESDARS FGILL 
WETFGLG\ VCP * P GMTNQQ P Q E Q VERRY 
WMSVSWQC 


9678 


23579 


A 


9749 


607 


986 


SFSSSSSPEFISYGCDLLPMRDPHVFIT 
HPAFNLPLFLALGMFFSLFSPSPLFIYF 
LPPPPRPPC*HSPP\PPPPLTNSLFVFL 
PPPPGVPSSPPNTSPPHLPVPRPPPSPP 
TSTHLSSPPPPSPTH 


9679 


23580 


A 


9750 


313 


339 


AR*GLS CSG* S * TPGFMQS S CLNLPKCW 
DYRC/RATVPSLCFVF 


9680 


23581 


A 


9751 


213 


453 


NLAVNVDE INLFFFPLFLIVTLASKFCS 
/HLLVGR * FIQS CHYFLFF I FDTESRS V 
T\RAEMSATVLAHYNLCLPGSPDSSA 


9681 


23582 


A 


9752 


368 


3 


PLSSPRLPFPLFDPSYIFLISISSIRFL 
SPPCYFLCVFLSISWGFSLVSQDGSDLL 
S S * S AFLGLPKCWDYRL * P PHPAWRELS 
FFFS/SFFYLR*GSLSPSLECSGMIIGH 
CSLDILVSSARA 


9682 


23583 


A 


9753 


2 


335 


ARAGFSGGDWNVGMQSTASGLSLFHCFI 
S*GELFIILYILLKCYITL*IN*LMFGS 
FYFRDLQHFRLP / TS I VGKKRPG T VAHA 
YNPNTLRGRGGWITQGQEFETTLANMVK 


9683 


23584 


A 


9754 


467 


120 


SQLFGRPRQTDHLKSGV*DQPG\QHCET 
PSPLKIQKLiAGRGWV/ CAPIGPRLLGRP 
R * ENGLNSGGE VCS E PRSRHCTPAWVRE 
RDCLKNQNKTKS G VliFHSPHQ CFL X SNT 
SRNLL 


9684 


23585 


A 


9755 


137 


368 


DGVYL WTHRP YCGLGS LiNFGS VI I VLP * 
VKAYGWMVIjTS LGDAI Q P LAD P E C S F \ G 
QLRELRCDMLHVDPEDFRLLGK 


9685 


23586 


A 


9756 


172 


267 


SKSGNKPNDYHLMNGSVRWCIRTV*QLL 
KRLTTELPYGPAVAFLGVYAREMKAYVH 
MKT YS LQKMCT * I F I AALFG I VKKWK* A 
K*LSPDEWISK/M/WCIRTVEYYKriRE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


96S6 


23587 


A 


9757 


56 


318 


QFWGFSFFFFFFCFIDTETCSVSQAGVQ 
WHNQSWLTA\P*SDPPAS\AS*VAETIG 
MCHHDQLIFFFFKQALCPRGWSAMAQSQ 
LTSSS 


9687 


23588 


A 


9758 


179 


1 


KDTDRVSRGNTQDTLKPPPPG \SSDS PT 
SAS * VARI IGVHHNTWLI FC ILVEMRFH 
PRA 


9688 


23589 


A 


9759 


219 


208 


NFCNFQMPLYQKILFWVRVSLLSPRLEC 
SGTIITYCSL/RTPKLK*SS*LSLGVQG 
CSKL 


9689 


23590 


A 


9760 


133 


371 


AMAYQLYRNTTLGNSLQESLDELIQ\LR 
NR VNFRGS LNT YR FCDNVRT F VLND VE F 
REVTELI * VDKVKI VSIi\DGKQTGF 


9690 


23591 


A 


9761 


813 


3 


CGEGWAAGQDQGSRMGSCLPVPRAGRQL 
GSLFPAFQVYCSSCGTQRSSQSQWHKTE 
RSLPGNNPAPQDS /ASAPKYQTPGP VLG 
VQS PLNQQS CHTALSPEAS S S AFFHTTP 
ATLLHQARTLSLTQGSRPTTWPVLLKMP 
A*RPRESRH\GR* EWTKRSQARSQEAIN 
ARPRKALGIiSDPAPSLHPGSLEDSPKFS 
LGAPKAPPLPASSPTAQETQQSHISQQP 
TATGFLGARLCPSPQHICQRGGFPGVTP 
LLLIiGEWGVGEHS FPDVLV 


9691 


23592 


A 


9762 


96 


377 


RYHTNMAAQ IPESDQI KQTGFHHVGQAG 
LELLTS\FKEFLGTYNKLTET*FLDCAK 
DFTTRERKSEETTLSEHCIiPKYSTMT\H 
RISLT\FTDNHI 


9692 


23593 


A 


9763 


164 


399 


TDEELLLRDEQ /RKWFLKI ESTAGADAV 
NIVEMTTGDLE*CINLVDKVAARLERTE 
TNFE /RCSTVRQ/MLSKS IACCTE 


9693 


23594 


A 


9764 


225 


25 


KCKITMNST* IE/ TVFKDYYEHLYVCKL 
KNLE KTDKVLT IHNFLRLNQ * E PEVLTR 
AKMYKIYNDIE 


9694 


23595 


A 


9765 


128 


218 


FFKDLPDCFPKWSHCFTLLLAVYEGFTF 
ST* LVIREMQ I KTP \ I * YHFS PNRMV IM 
KKS N I NKCWQGRG E S KAL I H C * *ECKTV 
* P LW KT VWQ I LKKL 1 1 G 


9695 


23596 


A 


9766 


194 


551 


KNFFFLEME F / S VLLPRLECNGVISAHR 
RLRL P LS S YS PAS S S QVAGD YRACTTTA 
G*ILYF* *RQGFHHVGQAGLELPTSGDP 
PAS AS QSAG I TGVSHCPQLKKS I LHETP 
KGLTGVTS 


9696 


23597 


A 


9767 


238 


3 


SFLWKLCPRGAPACLRCLSA\LLGGVS * 
SGYMGVRDPLEEAVCPFSELEHHAERTT 
ALFRAVPQGCLSLQKIiSAAFCSC 


9697 


23598 


A 


9768 


343 


1 


RGAHHSRGDCVRFTGFCAP I PVLNHPKP 
IiFP FF V* KS P I ERS S LLG P PLKRS KNRF 
PFLR/VPRFFKNHPEFFFKKIFLFPPFF 
FFFEMESHSLA\RLESHGAILAHCNLSL 
PGSC 


9698 


23599 


A 


9769 


191 


2 


FR I NH * TS H / LRGQ KKKKKTNS KAI GRK 
QITKIRA*LNETQPPESIQRINET*S*F 
SGKIKLI 


9699 


23600 


A 


9770 


165 


2 


GKP PNC FY I LNFKKKKGQD VFNRT / WLG 
WAHACNAS TLGS QGGW I T * AQE CKTS 


9700 


23601 


A 


9771 


97 


912 


VILSTGCSSGPLPGWQSVLHHSLTRCPF 
S FFLS S PTPEME / FSLLLTRLECNGE I S 
AHCNLLLLGS SNS PAS AS G VAGNTG I / C 
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amino acid 

residue of 

peptide 
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Predict- 
ed end 
nucle- 
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location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=IsoJeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /=possibie 
nucleotide deletion, \=possible 
nucleotide insertion 














GLHAC* AS * FLYFHSVETGFHHVGSGLG 
LELLTSDD\RPASASQGAGITGVTPRV* 
PLNSFSRHGLSCFFKETRVLSCCPSWT* 
TPGLK* SSCLNS * E / C * GYRHTPAHPAQ 
DMVFI FITY/ CPFFVFPLL / VYI PRFLF 
LRWSFTLIVQAGVQWRNLGSLQPPPG\H 
KQSETPSQI*INKY*LLN*LITFFFSIK 


9701 


23602 


A 


9772 


3 


363 


HELRS SANEHVFAS FIGPT I LGLPGVLL 
IILYPPLLIRTS IYLISN\RL\VTTQQ* 
LIKLTSKLMITIHNSIGRS*SLILGSLI 
I X I ATTNDDGLL P YS FTPTTQLS INLAM 
AIPL*AGAV 


9702 


23603 - 


A 


9773 


38 


385 


YLIIIiDS PEKGL I CGCLCNCMHS SLLDH 
LNLT * S S * MCLYANTHI Y VAI S C I YF I * " 
F*SI/YIFII*YFHYSIAYVYIY/IYIY 
MYTHAHIHMCI 


9703 


23604 


A 


9774 


13 


1350 


DRVS LLLLRLE CNGS I S AHRNLGLLGS S 
DYPASASQVAGTIGVHHHTRPTFALFLV 
ETGFHHVGQAGLELPTSGDPPPL\ASQS 
AGITGVSHCTWPHLSTTGKILSSGTICP 
GI WGQRWWAFL I L \ S LP WL * LWHPCLS S 
IMCLSLFFFLCQ/RCKPLISD 


9704 


23605 


A 


9775 


376 


1 


KYFELMYI PVIAI CLFNSEVFLNK*QFF 
NLTSSSG I QNLNRL I ISNEVES I I KSLP 
TVKNLGPDGIAATF\TYKEQLTLILKLF 
QKHEEARILSNLTSETSITLILRKQQPK 
KVPNEHTVKILV 


9705 


23606 


A 


9776 


192 


2 


NVYQPPPVFLYP PGVAGGV* PF * S PP VF 
RVK/LGF*GSLFFFFFFFETKSRSVTRL 
ECSGVILiA 


9706 


23607 


A 


9777 


77 


350 


WLR KFGS RNS GKLTIa YLNVY YLQ T S F F F 
FFFKIGV*LSPTLECRGPIWW*NLCFP 
G*RDFPTLTS*VL\GPRVILEFGFLEKT 
GFSHVPQIi 


9707 ' 


23608 


A 


9778 


133 


325 


P RW VR F P YLGD C T PR VS EMLGLQG WP S 
SSPFFVCFET*SCSVAQAGVQ/WWCDLG 
SIjQPLPLG 


9708 


23609 


A 


9779 


145 


362 


RLAFGSCSTCFPLT*TVLYYLI * FMIFF 
VFEMES LS VS \RLE CSGAI SAHCNLCT P 
AWTTERVDAAANSRSMN 


9709 


23610 


A 


9780 


260 


1 


Q KQNQ E KKRKK I FRN I KHQRD YNE Q * YG 
/NKLDNPEDMNKF*ETRYT*/PNLHQEE 
I GMLNRP VTNKVL * LVI KIPLTKKS PE P 
DGFV 


9710 


23611 


A 


9781 


230 


1 


SSDSKTGSSWLVACRFFFFLETGSGSV 
TQAEVQWHDHSSLQPPPPQPPK*/PGTT 
GA* STLAS * NAE I TGVSHHAR 


9711 


23612 


A 


9782 


20 


322 


SQHFGNPKGNPKIFRNLEINFFRLRNAD 
HMIISTDAQKVFDKIQHLFII*TS*QT* 
TERIFLNIiIKAS*KKPTANIILNG*RLN 
I\FPKTGETKHIC 


9712 


23613 


A 


9783 


158 


379 


LVKIICNSFQYISFFFFFFLETKFCFFP 
Q VE VQGGNFG * LNL P L P GLNHFS \ AS AS 
REPEIPGPPHHPG* ILVF 


9713 


23614 


A 


97S5 


54 


312 


KVDKS TK IGRNQ S KMAKNS KNQNAS S PP 
KDHNSSPARKQNWTENELDELTEVGFRR 
WVITNSSKLKEHV*PH/GKETKNLEKRL 
DQ 
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NO: of 
nucleotide 
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SEQ ID 
NO: of 
peptide 
sequence 


M 
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26 


Predicted 

beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown ? *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9714 


23615 


A 


9786 


13 


613 


PGAG WARQHHGV * DQ P GQHG \ KRP S LL K 
IEKLAGHGGRCLLS PAIGGGLKQGESLE 
TGGRDCSE PRLCHC \MPAWGNRNKTLS Q 
/ QNKTR 


9715 


23616 


A 


9787 


794 


164 


FSFLFCFFFLRWSLTLSPSLECSCVTSP 
HCNLQLPGLSHS PATVS *VAGT I GTCHY 
TKLS*FF\FDDSL*APFLGWDTETYYP 
DPPS S KGIiLPHSGQHGQQHP PAI SRAWL 
SC*VLPHI 1 RS/STSLGQP/RIW*LSKVE 
V*RGFTFS PTQNTPTNNNCS I APGRVD * 
SLLID*LQWCDHSSLjQPQTPVIiK\HPPT 
LASQSAGITGISHHTRPD 


9716 


23617 


A 


9788 


256 


380 


YI FRQAKTQGL P P CALFRE AVGENTP * V 
FAC/PKNVLFYVHP*LFG* I * IYQLKIT 
F*NYEGMVT/FVFRFLFETRSHLSPRIiE 
CSGT I TTHCSLELVGS TNS S I S AS * VDG 
NTGVRHNASC 


9717 


23618 


A 


9789 


22 


226 


TKINSKGNKELNVRAKTIKLLEENIDRN 
LCDLGLGNCFLD*YCIL*TKINSKGNKE 
LNVRAKT I KLLE EN I DRNLCDLGLGNCF 
LDMT PNAQ TTKGR \ IDKLD F I KI * NFWI> 
DT 


9718 


23619 


A 


9790 


288 


2 


QVWQE * AE TES AIHCWWERKAI QLL * KM 
RWQFLKMTNV\ELP YGSAI PL.VGIHGRE 
LER*ST*NIMFTAPLFVIGK/T*NLFKW 
PSPYEQITKMHSC 


9719 


23620 


A 


9791 


142 


361 


P PGEGKRVPFTTE P S PRI P S KAKSVSQG 
DTCTPMFTAALFTIARMWNRSKCPSFEE 
* I KKMR C I TT /ME C YS A 


9720 


23621 


A 


9792 


305 


346 


TYEYTDYGGL.IL* *liITFYDHAVLiIIFIi 
ICFLFL/YALFLTLTTELTKTTGGDAQE 
IETV* TILPGI ILGLIALPSLRRIiYITD 
DAPDAS LTIKSI GHQ W Y* T YE YTD YGGL 
ILNS 


9721 


23622 


A 


9793 


2 


317 


SRDRPRVRDR*LFSTNHKDIGTRYLLFF 
A*AGVLGTALSLLIRAELGQPGNLLGND 
HI YYVI VTADAFVI I FFIDLP I L I \SS F 
G\SVYVLLILGVPCTAVFLHSS 


9722 


23623 


A 


9794 


384 


1182 


RIGKI KG/ LCFLFV* WFLKFFFKMEFLL 
PRLECNGKI\HCNLLLMGSSNSPTSASQ 
VAGI TGMC * F\VFL IEMRF FHVGQAGLK 
LLTLGD/PPRPPKVLGI 


9723 


23624 


A 


9795 


205 


1 


GLQIKCTMRYHFAPTKMAITYFFLKKGK 
G\NNKCW*GS/GTEIGILIHCWWECGMV 
QLLWKTADAWADAW 


9724 


23625 


A 


9796 


307 


348 


QSARL*EAVCPFSDLQLRTGRTTALFKA 
VRQGHLSLQRLLLS F\VCLYPAPRGGAY 
RGRQASLS CGGLHPVRASRLLCLPKQAW 
AMVGAPTPASLPPCSSrSHCCASNQRDS 
VGYDPSSP 


9725 


23626 


A 


9797 


193 


381 


ILLI*IHILLSMISSPFFFETGSHSS/V 
SRLECSGTASAHCSLDIiPGSGGSPTSAF 
*VAGTTGA 


9726 


23627 


A 


9799 


153 


359 


FGTYMCFADSYQLSYL.FWGTFSWLEEE 
GV*WCICGSLQPQPPRVK*/FLPSLSLL 
SSWDYRCAPSC 


9727 


23628 


A 


9800 


2 


396 


ARAARAARELEELIKIFFFFFFFWGQIL 
ALMPKGGGQGGILTYPNPPLPG*NHFPG 
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26 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G= Glycine, H=Histidine, I— Isoleucine, 
K~L,ysine, L=Leucine 7 M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














LTPPRTGINGLGPPGRVNFGIFKKKGGF 
PPGARGV*NPGPRGTPPPGFPKGR\DKA 
PHPGPL*TFEKVPRPPNFPQ 


9728 


23629 


A 


9801 


256 


347 


Li Y I LLLNS LiK * L VE KLR * NWLAS L 1 1 R 
YKVTTKRF* *FSKPAQSEELQLLTSAPG 
LNF*RYIISFFFWIQSFALVA*VGVQWC 
DQGSPQPLPPGFKRFS/ CRSWDYRHEPP 
R 


9729 


23630 


A 


9802 


109 


401 


HLRRP CPTPPFALRTSGDQPLS PEARPD 
S GLLHS P LTS PRGLNGQ C P P RGS TLN FN 
P*ASIPASPFPSGQPQASQEPPPVASEH 
PPDT * GDPAQPPP \RPQDFWGPATVPRS 
QAGQWPSPLPSDFSKGAQWPVPPTR/PP 
PSTSTQSRWT 


9730 


23631 


A 


9803 


371 


2 


PGFPLFSFPEGNGPSKRQTGD* IRCLF* 
DGKVWE CS PPKKKKKTKKRKKAVI F / CV 
PVQTKCI WEGGEETLiVGDV* V* P* \GS 
FKHWAMFPEK/ DCLCTLYEAS FKTKES 
RRVDG F VC VRVGT 


9731 


23632 


A 


9804 


188 


1 


FLFFFFSET*SQSVTHAGVQGLEQSLPP 
G\SGNSRASASQVAGITSRRKHSWIIFV 
FFVETG 


9732 


23633 


A 


9805 


112 


383 


VFINIRVFRLFTFSFFFKRGFNLiSPRVK 
CNG*TKGHCIPDLPGVR*SFPFSLLTNW 
DYRCAPPRPNKFFF/ SFETGSHSVTQA 


9733 


23634 


A 


9806 


599 


237 


FRDRS LTLLPRLHCHCVI TAHCS LKLLG 
SSELPQASPLSSWDYRHSPPCLANFLFF 
VETRSHY/ASRNSLGSSNPT*A/ FPKCW 
DWQ V * AT APS LAY W F S E AE KL I LRL LAL 
KVSDENLCTNF 


9734 


23635 


A 


9807 


168 


2 


PQQDVFFFFLETGSYSVTKAEVQWCDHG 
SVQL* PPG\STDPPTSASQEATGTHHRA 


9735 


23636 


A 


9808 


225 


3 


GPLGYLPPYFPQPL,ISRG*NPSDFLKIS 
RVF YGT / HYKFE FL FLFFFETE S RS VA\ 
RLECSGVISAHCKLCLPGS 


9736 


23637 


A 


9809 


541 


3 


RQLTGINSGRRQFQPQMSNYVI PYMMDM 
IILKTKKSNNKYWQGCEKTELLIHCLWE 
CKMVQLVWKTV* QFLNRSD I EFPHDPG I 
PPL/GYKRKKMKTCLPKKLCVLVEP/RM 
FTAALFEVAK* *KQ/ PQTPITR* INTMW 
RIYTMEYYSAIPRKTPLIHPIA*LDTES 
II*TEKIQSQETTYY 


9737 


23638 


A 


9810 


213 


1 


FLCVDFVPCNFAELKLALVGLFVCLLFG 
SVTQAGVQWCNL\GSL*PPPPGLSDLPT 
SAF * VAGT TGVSHH 


9738 


23639 


A 


9811 


54 


388 


PARPLPRQ/WDERPNQQPTKKKKKRGRN 
LPTKKKTQNYSHQNPVAQRNPGGKTNQK 
KPKTNPKKKKGGAL * KKTQRGPKQTGGR 
K*KISPNKGGK*NPCGKLLEKNPFSGGE 
K 


9739 


23640 


A 


9812 


429 


24 


RSKGAEFPICPCRSSKFSVLQQRQVRIV 
FQHPELGES VAR/ CRYSQLLGRLRQ* SH 
LNWGGRGS SEPRS CHCI PAWATRASSI F 
CNFQAS S VEVRRS ARKKLFSD I L.KRHNT 
INWRVSGLLLVDSYFGRLATPVRTQ 


9740 


23641 


A 


9813 


149 


350 


CHFP C FHTTL/ CE * EPGEDVTGP I GTKF 
HLCVWNTLiIHFV 
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nucleotide 
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ing to first 
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residue of 
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Predict- 
ed end 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P— Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9741 


23642 


A 


9814 


66 


401 


PENKKT VYRKP I ATI I FNCERLNALPLK 
LETRP*YLLY*LFNIMLEVLASWRQEK 
EINLSIFT*DIVIK/MKIPKNLENKLLK 
LLSPFSKITGYKISIQKLTSSFLYYGFT 
V 


9742 


23643 


A 


9815 


82 


1 


APPPARFFFFFFF*EKFSSFPPGEVQRG 
VFCPCNT/CFP*G*KIFFL\KPL,KKVGF 
WPPPPGPKNFFYF*KKGGFPFWARGFFI 
PTPFVPPLHPFKKWGVKRG*APPPARFF 

FFFFFL*GRVSLCHPGWNA 


9743 


23644 


A 


9816 


256 


2 


QHFP*LPIPHIPIQT*QHFPQTRSPFQG 
PPLCQKPFSSPWGSKWFILLSFFFFF* 
DRVSLCHPGWNAVM*SQLTAASNTW\VK 
* S SHLS LLSS* DYRCVLS 


9744 


23645 


A 


9817 


53 


411 


TIYCSNYVYNFVSQHFRSLYS*KNCGLC 
L* FHLF VLQS *NFMQFHM * FCDKI * VDP 
I ITFEV* FTYI * FF*TYSS VS FTDAYNF 
VKPHR/ VYRGPGAVAH I CNP S TLGGQGG 
WIT*GQEFK 


9745 


23646 


A 


9818 


100 


417 


DEHSEKQDEPLQRPRSKLLPFPGTPPSS 
IjLLLNHIGQESGLTHVRLASCLRNRGFC 
FDMKERLCLLPMLECRGMISDHCSLI/R 
PG * DKP P AS AYQ VGETTGT YHRA 


9746 


23647 


A 


9819 


136 


2 


IYMGMQPGAVAHACNPSTLGGQGG/WWI 
T *GQE FKTRLANMVKP C 


9747 


23648 


A 


9820 


257 


426 


IVPGLFLGTGSCSFTHAGVQ*SNCSSVQ 
S *TPGP \ S D P P AS ASRVAGT TGAHHKAW 
L 


9748 


23649 


A 


9821 


151 


416 


LHKLCLMVME S GTAKT P S LS P I FNS VGL 
ILLLLEYSIDSLC* SVLCCYKEIPEAG* 
FI KKRGL / WLG WAHACHLS TLGGRGGW 
ITRSGN 


9749 


23650 


A 


9822 


50 


513 


RGDPRVRPRVRI IKLSEEGMWKAEIGRK 
LDLLHHT I S QWNS KE KFL KEM KS AT S V 
NTRMIRE * NSL I ADMEKVLW* DQTSHN 
IFLVQNIIKSKGIGDFKLMRAKRDEES/ 
AEGKLDASKGRFVRLK/ EKRSHLHNMKV 
QDEASVDIETTASYLKV 


9750 


23651 


A 


9823 


48 


406 


LIIWYTLTTRWIGHSAVSYPLLRPPYS 
LRHNIEIRPINNPTMASKCSSERKSLTS 
LT FHL K I KM I KL S E EGVS KAKTG * KLGL 
li/R/QTVSQWNAMKKYLKEIKSATPVN 
T*MIRKQNNL 


9751 


23652 


A 


9824 


143 


444 


WNKKNNKNELADM I LWGCGDKT I FMRKL 
TRDETKYGI PQLALRGHSHFVRDGVI \N 
LDGQVDLS\G*WDGTIiHLWDLTSGSTTR 
RFVNK/TKDVLSADFS 


9752 


23653 


A 


9825 


297 


2 


HRSPTKETKIIAS IKKKTMITKDLNRQF 
S KQDKHLTSE YMKRCPTS / LREM* AKT I 
MRYHLTP IRRAP 1 KNKNK/ NKS KCW * GC 
GETE I LVHCW* ECKMVP 


9753 


23654 


A 


9826 


175 


400 


GGKGGKNFSLKGGEKKTNLGI FGKKP I F 
GGGTNGANPPPKIKGSKEKKNF*VFFPP 
FPFKNFFFP*NL*FLGGGWPHLSPPQK* 
V/CFPKIPKLVFFSPPLREKFFPPLPP* 
NLGPPGF/ SFWGPPLFFFFFFFF*RDKF 
SFYHPGWKAVIAAHFSLELLGSSNPPTS 


9754 


23655 


A 


9827 


64 


622 


MFSFFLFDLILNHIjFFCFV*MFSFFLFD 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrostne, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LILNHLFFCFVIRQQFSFLFF/ ISFFFS 
FFH CNVNT PE L CD F S VC I HS DNS FRFLH 
LF/NF*HMTLCISELR*TLKQSCYSYTK 

IC**FYSVQLSFLTILY**CKS*QALSI 
ALQNE FLS V\QYHKNMNYI TLT I EFSRN 
KFLS VTFLLFS I FSHNFGI SAS ISIFIL 
HVYIKNNPIGFFK 


9755 


23656 


A 


9828 


298 


1 


MRNCLTDDERKWFPEMASTPGEDAVNTV 
EMTAKDLE/Y* * YINLVDKAASDFEMID 
SNFERCSTV/NKMLSISfSTACYRENFHER 
*GQSVQQTFVRVGERV 


9756 


23657 


A 


9829 


361 


1 


FLTGTQ WGK/DS PFNTWC* DH/ WNIHRN 
LDLYL I P YIKINLKQLTGPNLRAKTI KL 
PEQNIGENLCDL*IiSRERYSTKSITRIR 
KL/DTLGFIKIKNKCISKDTIRKR*ATD 
WEKIFANHVLKG 


9757 


23658 


A 


9830 


402 


2 


RFHHRFS FVLFGPFAKVAFP IGAKNF I S 
NHYWAGVFPFWEKKNKIDSFLPPFSKNP 
ILGELKA* F*NFLF*GFQ JFF*SLFFCP 
FGGK* IPGVFSFFFFFLVFFEIGSHSVP 
\RLKCSGTITAHCSLDLLGSSN 


9758 


23659 


A 


9831 


2749 


3215 


FCQ*IKMYVMCICIYNFRRVYLCIYTYL 
HTHT FTHTHKHTP TQ I P E KD S/QCSLSD 
LKGHSL 


9759 


23660 


A 


9832 


3 


386 


KLRLGQLNTSAMAAKC* SERKS / R/ THL 
TLNQKLEMIKLSEEGMLKAVIGQKLGLL 
YQ/TSQWNAKEELIiKDIKSATPVTA*M 
IGKQNSLIVDLEKV*W*IEDQ/TSHNI 
SLSQSLIQN 


9760 


23661 


A 


9833 


1 


370 


RRCRWPPDPSRTVGRQIGKLVTHRPTVF 
Q ERG C P F P LTRQAGS HHGGGAFAQ V ISP 
TKS I S PCGRGGSRL * SQHFGRPMQVDHL 
R/ LGVQDQ / RWP I CGQ YGE TPA\ LLKL P 
KISWAWWLAPVI PA 


9761 


23662 


A 


9834 


125 


409 


GQENRETWKIVHLV*VLYT*HIKRLNCF 
SHFI * CYQPTASQAHVHHDSNDSSTHV\ 
N*NSRWPGTVAHACNPSILEG*GGWIT* 
GQEFETSLANM 


9762 


23663 


A 


9835 


223 


1 


PKPQKMGTF*PPMEGS*MGPPHWSKMGP 
FLKKKEGHPFFFFFFEMESRSVT\RLEC 
SGA/T I S AHCNLCLPGSSNS 


9763 


23664 


A 


9836 


377 


515 


FILFLRQVLTLLPRLEYSGAIVAHCSLX 
AF*G*SDPPASASRV 


9764 


23665 


A 


9837 


274 


1 


AGEWHDQ I C VFKI PLMKNGLHGARVGQE 
SWKERNQQV*DRCQKQNLSDS*IQT*R/ 
M/WPGTVAHAYN/PSTLGGQGEWIT*GQ 
EFETSLANMVK 


9765 


23666 


A 


9838 


553 


114 


GAGVKTHPGHKGKTLFFFKFFFQKLPGV 
GGKTLPSPLSRGLGRENSFHPGGKGSNK 
QSSPPSFPG/WGKKGGGLPFQKKKKKEK 
RKKGKKNGVAPSE P PLKYSN/ WQATWG * 
RKLLND TMR VQP TKS RMQ K I LQD KGP VN 
GIFTKTGRIi 


9766 


23667 


A 


9839 


36 


434 


LPFPQCVTEFIIVLISWCYIREDACKNL 
KHVAI T I INVCI Y I C VCVCPS TL YVYMY 
MLP/HLSDTLLTLN*VTLVEMLSSFQCL 
MFWYHGHI *K*HELDWAHACNPSTMGG 
QGGWIT*CQEYKTSLANMVKPR 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possib!e 
nucleotide insertion 


9767 


23668 


A 


9840 


268 


1 


RFPPFFCFLTILFWAPGWFSNFLGVRA 
PQPP*IFFPI*KKPSPLIFFFFFFEKES 
RSVS\RLEYSGTISAHCKLCVSGSQHSP 
AS AS 


9768 


23669 


A 


9841 


463 


86 


LYMHEFISGLFLLFCWSI/WSVLCQYKN 
C FDDKVYGLDYGNTS Q VFT * NS KL I KL Y 
TLTMYSFLCINQTTIKPSEMIQATKANI 
DK*NYMKLENFCTAKSTVNKMKRQLKKW 
EKVFSIHIPQEVNI 


9769 


23670 


A 


9842 


37 


400 


VHSPPFKFWSLYVSVLKFPFCSLYFKFL 
ALYS I FLQMCQHLLV* AFFF * KRGLNSV 
T \ RL E CRGAI S PNCNL CL / S GS S DNWAS 
LPRKAEAGNSFT / CR* SQVAGNP I PS I S 
TRLNQRKTPFES 


9770 


23671 


A 


9843 


42 


407 


NCLSFLITFFFFFLFENKVSFCPQG*G/ 
RGGPFWPHGTLPPRG*GNNPP*L>PGRGE 
* RGAP P P PG YF WL FGKKRG F PL \ GQGG S 
KP PALKE PP PLGP P KGG\NYKRNP PPP P 
GNFF*LPPYQW 


9771 


23672 


A 


9844 


113 


379 


LGPWATSLiRGRFLGYQLQQSKKH*NYP 
TVR*GCGEMRTSVHCW*ECKTA*PL*KT 
VWQ FLKKLNTGL P FDAE TP L / LAE T P KE 
LKA 


9772 


23673 


A 


9845 


284 


528 


PLiPKTMEIMLDKKQIQTIFLFKFKMGHK 
I AETTRNX KNAFGPGTANE \ KCTAQWWF 
KKFCKA* E \ SLEDKE A\ *GHPSE VDTN 


9773 


23674 


A 


9846 


888 


1295 


LEQGCNLFLHKHS *LGEVFFFVCFRRS F 
TLVAQAGVKWRDLGS L / HKLPS SLSLPS 
S WDYRP PLPRLA/ NFFVFLVEMGF \ TVL 
ARMVS IS* PRDPPTSASQSAGI IGVSHR 
AGPVAG I LMFCRWNI SNKPKGAVFKKK 


9774 


23675 


A 


9847 


88 


387 


AYRMKI IDRISLSLYTAALWVTLFLFLF 
LFFFFKTEFGFAPQAGGQ/WGQFKLMDP 
NPPPPG*K\DFLVSHPRDLGIKGAPPQC 
GQNFVFYKEKGGWLLTAT 


9775 


23676 


A 


9848 


298 


438 


KI PRGAPNFPGGGKI FS PPLGGP I KTPR 
GALEKKTLF*VRGPWPGFVKKESPRPKK 
LG*QIFWPPGKKPNPPMGG/SLDPPPPL 
ILSRPDPP* KKKKKKKKKKKGGG P L KNP 
PGGPKFPRGGKNFFPPFRGAYKNTPGGS 
*EKNPFLGGGKKRKQP 


9776 


23677 


A 


9849 


251 


498 


All KKME S KKFW * RHGE I GTL I HC Y WE F 
KMV* P F WKT V* Q FLK/ D /LNMGL P FDS A 
FQLPD I CLGELKTYVHTENCTQMFMAAL 
F 


9777 


23678 


A 


9850 


478 


37 


E S RNKHS HLW S INFQ KGLR * LNRGKQ PL 
NK* CRHNS I S TCKRMKLDPYLTP YI KI \ 
KS KWTKDLNVRAKT IKSLEENI EWLHD 
LG * GND FLDMTPKAQTTK*NI D * LD I IK 
I *NFCGSKDTINKVRRQSTE*EKI FANH 
VADKLK 


9778 


23679 


A 


9851 


2 


378 


RLEGLFLCALFCS I AI CMFFFFFFFF*K 
KRGPFGFQGGDPGGNFRQLDPLPPGIKQ 
FWGP \ P PRKAGKTGGGP P PGANLGF WGK 
KRVPHGGQKGFKP/RNPRGSPRPGPPKG 
GVAFVFRPKP PGLEQ 


9779 


23680 


A 


9852 


229 


3 


FGPYKI FFKKKGAPCPLENPPVF \HQKL 
GLGFFFFSFFFF*DGVSLFLLRLKCSAA 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T-Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, V=possible 
nucleotide insertion 














ILAHYKFLLPG* SDSAASAS 


9780 


23681 


A 


9853 


108 


420' 


GPRVCVRLSLRDDSDLLATEDSLAHTEG 
DPVTAS SHS QAP S LS VHPGAALEGVCWG 
TGAIAGTPEPQKPPQLTPGGRPA\PIAP 
GLTWEPPSHPPPQ*RSPPPPP 


9781 


23682 


A 


9854 


2 


256 


FFIFPLRYYIYCARFQFLSPILYLGK*K 
MD* * RRVFQE KWE Q AYFF VE V\ S PMCL I 
CNQTLS VSKE YN * R CHYETNHGENFD * F 
TE KMHDENLLHLTTRQ P 


9782 


23683 


A 


9855 


323 


3 


I IQMANKHRKRH \ QSQ * * VRVMQKKI I M 
IHRYI PQWLKTSKSDMIKCCSGYGTTE I 
FTHYLiWE CKLLHLLWRMVWL \ ILLKFKT 
DTPHDPAIPLLGTYPAECMHMYPRA 


9783 


23684 


A 


9856 


28 


385 


DGVTQAGTQWGS KFTAA* TA* YQGS I PN 
AWSFFSPPPVLKTPPPPQKK/YPPPKKK 
KIPPPKKK 


9784 


23685 


A 


9857 


436 


518 


AANRLENI YRHL I YDKD / EHYRIVGKDD 
LFNKWCWVNWI/ASNKEKNLDSYLIPHT 
K/LNSR* I IQLDVKD* I I KLLEDNLYLH 
DLGDRQKFLGRI/PVFTIKKKIGKFDPM 
L»KL S T F VHQ KT PC* RLNS A 


9785 


23686 


A 


9858 


153 


492 


RLAGSDPGVADVSVQLQRQEKSPCPSWK 
AVS Q AE S S S S S AGVSL VLFRLIADWMRP 
I P I KE E NKL \ S Q S TDLN FN F I QKHCHRN 
TQNS V* PGGVGPPGTCGPFRRQSSP 


9786 


23687 


A 


9859 


2 


419 


TTGKLQVSHKKSTYSHFSKKPTHTTNKD 
MKRYSPSL/AIREMQSKTTG\MRYHFTP 
KYG\NNKCWLGCGETETLI YGW* ECKMV 
QPLWKAVWHFLK* IdSTIES / PIY* *NYS * 
YISYSWRKTCITIYDPVILLLGIYPREV 
KTH 


9787 


23688 


A 


9860 


88 


419 


TFFFFFGFWFLKTKPLFVPQLFEGPGP I 
LG*WNPPPRD*KNFS/GPNPPGGGE*RA 
QPPPPGYFLFFKKK\GVSPWGGGGSKPP 
T PGNP P PGP P KI R VI RGG P P P P GGRKHF 


9788 


23689 


A 


9861 


301 


401 

> 


KRA*GGQEPIKVCPFFYNRKVGNGSPPK 
KFKKGRRAAFFGLRANKRGLFVKKGKKI 
WVGKVGEP IT/DP FKEFGGR\LPEKDGP 
LV . 


9789 


23690 


A 


9862 


220 


415 


KIMGGAQ I FRGGGGFFFFLEGWEKNNFG 
VSFRRKFFFGGGVFLPPPP*HKKNYFSS 
QRQYISLGGGGRKTPPPKKNFLLKDTPK 
LFFSHPSKKKKNPPPPRKIWAPPMIF* I 
PPPPIFFFFFFFFFFFFFFFFFKN*P/S 
FFFNFKKPIFKTFLSPPFKVFPPPKKKK 
KKNP P I S YRRWPLAI 


9790 


23691 


A 


9863 


116 


366 


GQEFETSPAMFCFETESCSIA*AWQWH 
DLSSLQPLPPG\SGDSPASASRAAGELL 
EPRIRRLQSVEITPLHSSLGNRVRLHL 


9791 


23692 


A 


9864 


619 


295 


FFFEMEFSLLLPRLECNGAISAHRNLRL 
PGSSDSPASAGGLL*SQVAGITRLRHHD 
*LiILY/FLVEMRFHRVG*AGLELiLTSGD 
PPSSASQGAGITGMSHSAGHYGKIF 


9792 


23693 


A 


9865 


12 


432 


IADRRLFFTNHKDIGTLYLLFGA*AGVL 
S TALS LL I RAELGQPGNLLGNDHI YNG I 
GTAHAFG I ILVI VI P 1 1 IGGFGN* LVTL 
IIGAPDMAFPRINNISFGLLPTSLLIiLL 
TSAILEARAGTC* TVYPCLA\GGYSHLG 
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Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine ? I=Isoleucine, 
K=Lysine 7 L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *— Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














A 


9793 


23694 


A 


9866 


30 


380 


LFSTNRRGIGPLYLLFGA* SGVLGPALT 
LLI*AELGQPGYIjLGDDHTYNDIDTAHA 
FVIILFIVIPIIIGGFGD*LVPLIIGAP 
DMAFPRIN/NISF*LLPASLLLLLASGI 
REGRSK 


9794 


23695 


A 


9867 


256 


395 


E VLRKS EKS FET VC I INNKC / WPGMVAH 
ACNPMTLGGRGGR IT* GQE F 


9795 


23696 


A 


9868 


458 


2 


PKKRFFSQKPPRGFYSAPLKGKNYIFPP 
PVNFGPPKDFFKGPPLFFFFFFFFFFFP 
RFFFFFLEKG*KF*NFFFT*NMVFFYIS 
/CSKKFPFQFVTiTPIPF*KVPKPWFFW 
DFPI FFKKI FFFFFFCDRVSLCHPGWSA 
VAR S RLiTAS PRV 


9796 


23697 


A 


9869 


144 


425 


IPLCSRIYSFGPC*FSLISSSSCLSPFL 
S I PLCVLS * * T * * YLFF YI S MRE IYKTR 
TKGIYPGAQRNSLYTH\FSIEVQSTIKA 
ERFWPGAVARACNPSTLGGRGGQITWGQ 
EFE 


9797 


23698 


A 


9870 


138 


401 


D LRLKLrNL S KN 1 1 RLQGT VE S S VLT I KE 
MQ I KAKVR YNL I P VKI KR / SGNDRWW * G 
CGERGTLIRC* *ECKLVQPLWKTVWSFL 

KKKKK 


9798 


23699 


A 


9871 


179 


441 


PSGKGGIKGPPPPPKKIFGNFFFFLKKK 
GVPPGPPGGPKPRPLETPPPNPPKGGEY 
GGGP\WPPPPIP*FFKGPFKKKGPPWTN 
RGNP 


9799 


23700 


A 


9872 


57 


435 


FTQMRKNLKNNSGNMKK*GS ITPLKDHS 
NFPAVDPNQNEMFEIPGKEFKK/SDY*V 
TQGDTRERCKPT*VLKTILDMDEKCSKE 
MD I LMKNQS E LLE KKDTFRELQNAME S F 
NNRLDQVEERITELE 


9800 


23701 


A 


9873 


228 


443 


FS S KF I LI S VFLFL FLE TGS CS VAQAGV 
QL*DYSSLQPQSPG\SGNPPISASQIAT 
TAGACLHALL I FVFFV 


9801 


23702 


A 


9874 


349 


2 


KKPKKKNFLPPKNFGFFSPFSP*KFFFF 
LKGFNFFRGFFPNFPPPKKKFFSKNSQL 
VFFLPPLKKKI F/CFPTPVKFGP/SQRF 
FLKGPPLFFFFFFFFLDRVWLCYPGWSA 
VARSRIS 


9802 


23703 


A 


9875 


405 


2 


KKAS RMENVR KKKKLEM I KL S E EGMLKA 
NIGQKLSLLAKQQWNAKENLLKEIRSA 
IPLNTQMIRMQNSLIADMENIL/WWI/ 
EDPTNYNIPLSQSLIQSNALT\FNSMKT 
ERGEEAA*EKFDAISG*FMTFKERS 


9803 


23704 


A 


9876 


355 


394 


THPYYSHQE YQS P * P/LTGALS ALLKTS 
GLAM*GHFHSITIiLILGLLTNTLTIYQ* 
WRDVTR* SAYQGHHTPPVQKGIiRYGI IL 
FITSEVFFFAGLF*AFYHSSLAPTPQLG 
GHWPPTGITPLNPLEDPLLNTSVLLASG 
VSIT 


9804 


23705 


A 


9877 


47 


85 


TIYIHICVRIYACECVCVCAYIHICGVP 
AHLTEQNL\KPARTQSP PAPQKNPELPV 
DLVS I SLDREAQDVELNHYRLGNTCRTD 
VLPKVNTLCLRL/NFFPCLPILEELQEL 
P*LENIPLFPQPIDYIYTYMRAYICV 


9805 


23706 


A 


9878 


2 


389 


GRGQFPPEFFYFFWGGSLVPSTPNKKKP 
LGEVLGGGWSNRAP KGWGGGFP P P ILAF 
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KPKGPFGPKFVLKFSWPPLLFFPKMRG 














PLPKGGGFLPFFFFFETKSRSVTQPGVQ 
WCNFGSL*PPP\TGSSNS 


9806 


23707 


A 


9879 


351 


1 


FLGPSKNFNPFPPQKWPFAFFPKSHRAR 
VNEIFS/RLSVINF*GLTKPQRLFFSLL 
NFSS\PIPPFQFPSRLIKEIRFSTFLTP 
ENSKI KKPLFFF * FFFF*DRVTLCCPGW 
SAWQSR 


9807 


23708 


A 


9880 


123 


387 


SFLWKLRPSGAPTCMRCLLA\LLGGVSQ 
LGYTGVRDPLEEAVCPFSELQRHAGKTT 
ALFRAVRQGCFFFAYF*IELFPCY/CSG 
RVGG 


9808 


23709 


A 


9881 


341 


659 


S FLGRVQ WLHACKS QHFGRLRQ VDHDRS 
GV^DLPGQHDETLSLLKMPK/QKLARHG 
GMCL* SQLDRRLRQENHLNLGGRGCNEL 
RSQHCTPAWATEQDSISKTKTKK 


9809 


23710 


A 


9882 


216 


1 


PKFFWVLRDSQIFKPGFGPQGFF/GGFG 
NPQGKKKPGFWGLG* KKKGFFFFFFFFF 
LRDG I S FCCPGWS I QW 


9810 


23711 


A 


9883 


2 


244 


GRVGAVGRREGENFIKCDELPPPPPEGK 
QQNLCKFLTGFSLP CQDKKP \ CFFV* KK 
KKKKKKKKKKKKKKKKKKEKGGGGA 


9811 


23712 


A 


9884 


310 


2 


IFLQTLFPNTFENSLSEYCIIFSEYSFI 
**KQ\TNFILSSIKLYYI*PSGSYISNL 
*Y*ELFSQNLSYQHFLWI*KKWGLGAVA 
HTYNP S TLGGQGRW I AWSPG 


9812 


23713 


A 


9885 


242 


358 


KTVWLYMNI *MANNHME*CSLPLVI REM 
* IKTPVTVH\TVGMAAI *NKK*NITR/C 
W*ACRETAVLVYCWGEFYTQPL*KTVWL 
YMNIE / 1 PCDPAI PLLGMYLKE *RPGTV 
AHTCNPR 


9813 


23714 


A 


9886 


360 


2 


LLKLGFPFWAKKGKKKRLWVEGPLGKPP 
GGFCQKLFFSPGGGPKGFSPSIIILPKG 
* PFWGPFREVF* KKRFKLKKKFRPTFS 
/Y/LFFFFETESHSVA\RMECSGAILAH 
CMLCRLGSSD 


9814 


23715 


A 


9887 


236 


347 


IKGFFFF*TESHSVT\RLECTST/SAHC 
NLCLPGSSNSP 


9815 


23716 


A 


9888 


310 


3 


NFFFFFETGSCSVTQARVQWSDQ/GSLQ 
PRSPGL\SDPPTSAS*VAGNLKFLSGLC 
S S L * S LE I S *DTKKSQVQI TDDLC YT VV 
LCL F FEME S HS WQAGVQWH 


9816 


23717 


A 


9889 


119 


391 


APAYWNPHARDSPFFFF*TNFPFFPQVG 
GQWPNSGSPQAPPPG/SPPP/YSASASK 
ISGAPGAPPPPGPNFFFSFFSSPTFFSR 
DGVSPYEP 


9817 


23718 


A 


,9890 


195 


3 


DGVSLIiLPRLECNGTILTHCCLSPV/SW 
DYRRLP*RL/V* FFLFLVEMGFHHVAQA 
GLELLTSGDHP 


9818 


23719 


A 


9891 


3 


378 


RDGERNMLALGTASAKALKYSLALSLPR 
LECSGTIISHSSLNPP/GLKWSSCIiSST 
S S WD FRPAP PHLAKKTKQQ / HAT F F * EG 
ESPYVAQAGFKLLASSSPPAFSLPKCWD 
YKL\DRHAW 


9819 


23720 


A 


9892 


231 


1 


PKPFFPKFFSP*N*KPFFQISPAKIQKI 
RVYQS \HKKFFSFFFFFHGVSLLLPRIjE 
CNGTISAQCNLCLSGSSDSP 


9820 


23721 


A 


9893 


2 


347 


APARQENWCVYVCVCMCVCIWLHRHC 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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nucleotide deletion, \=possib!e 
nucleotide insertion 














E VLCF YKMLGKKRS I YAY* L VY* Y I Y I H 
MCVYI YI / YI YV/YMYIYTYVCI YTH 


9821 


23722 


A 


9894 


28 


382 


TAHLEAGATGYRTFLLPAIGLLCLPWVL 
DDGAGQTAQ * SMLLDHAMLLAHRAHEL F 
i/DTYYEFEETYIPKDQKYSFLHDSQTS 
FCFSDSIATPSNMEETQQKSNLELLRIS 
LLLIESW 


9822 


23723 


A 


9895 


93 


386 


FWKIISHYYFKYFSCSILISPSDLPITC 
ICMFHLLKLCQSSEFLHLLLFVWLVFET 
QSHSIKQPECS**LSAA/CHI.PGSSDPP 


9823 


23724 


A 


9896 


174 


1 


TLDWGPRKKNPFFFFFETRSRSVTEAGV 
Q*HNHGPLQPQPAG\SSDSPTSASQADA 
W 


9824 


23725 


A 


9897 


251 


386 


LQRLECSGTI/SAHCNLCLLGSSNPLAS 
AS * IAGTTGTLTGDVDST 


9825 


23726 


A 


9898 


127 


393 


GHMGTLTGS PATLS I IiALFFWGGR I STQ 

ldginksXfslflyflrhcslslrle^s 

GAIITYCRLELLGSGDTPASASQETGTA 
GTCHT 


9826 


23727 


A 


9899 


3 


334 


KLRLG Q LNTS AMAAKC * S ERKS / R / THL 
TLNQKLEMI KLREEGMLKAVIGQKIiGLL 
YQ/TSQFVNAKEELLKDIKSATPVTA*M 
IGKQNSLNCDLEKSCWLIQDQTG/HNI 
FLS 


9827 


23728 


A 


9900 


356 


1 


GRKPFSHLPKATLLPMGPVLGGRRALMG 
PDSRPGPVPSCSLVLLTPLAPLPLTARE 
SLCPCPPS * TPQPS VNP/H/GKLARRS P 
CWSGRQSLP*AEIVPLHAPPALGDRDE 
TPSQKKKK 


9828 


23729 


A 


9901 


118 


343 


I P YAKE KKKQENVLR F I H VNL C I S NW F W 
FETGSHFVTQG*VHWCNLSSLQP/PNLQ 
GS GD PPT S AS RAAGATGVQH 


9829 


23730 


A 


9902 


321 


50 


SLRTRV*RPPSAPG*NPFCLKPQKIFQG 
GG\ESPLSQILKRVKQENSYNLGGKGFN 
*PKLPPCPLTWATKKTSPPKKKKKKNYR 
TWETSVS 


9830 


23731 


A 


9903 


317 


8 


NCYDPNSGIVIPLLGILPKSTK\T*VRT 
KTCT * ML 1 VALC 1 1 TKKWK* S KCLSTD K 
Q VNK I * Y I H I ME Y YS P I KGR K Y * HTLQH 
/WMNLENIS * KRLD I JCKPH J I 


9831 


23732 


A 


9904 


287 


3 


RGALNGRGGPRGVKS PGFAPFKLQLKPP 
GTFLNPK\QFPFFPNPPF*KKGAFFHFF 
LPLF*SFRWTEFYSVT\KLECGGAISAH 
CNLCLSCSSDSRA 


9832 


23733 


A 


9905 


2 


406 


PRVRTSSRSRATALFFFFFFFVFFWGKG 
EIWAPPLKNC/ IPPEKFCYFGGGAPGAG 
LPPPPGVIFFSLCRWKKKKKKGAPPGG 
VLGKKKKTPFTPKGGTFLTLGGF* KKSL 
FGKKTLLWVGGPLLIj KNFF * E KFR 


9833 


23734 


A 


9906 


1 


296 


IWVGATECIFKNNPTIWGHQETHFTCKD 
TYRLKV/KGWKKIFHTFHTNGNQK*AGI 
AIVISDKTDFKSKTIKR*KGHY/IMTNG 
S I QQGD I LDW YKSNCS F 


9834 


23735 


A 


9907 


474 


41 


FMEYLTLSFIHVLLQ*FIHCFVHLFIHS 
FFYALMKSLIQ*FIHLYSCF/ICIDLLI 
SSCIYSFIYALVKSLANSLTHSFIHTFL 
Q*FVHLFTPLLICSFTHSSIH*PSHSLT 
FIQLHLFFHALVSGFIWSFIYLFCKINV 
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FLIDK 


9835 


23736 


A 


9908 


286 


84 


LKPPPPFFFFFQANFLCFIRD/GFHHVG 
QDGLPLLTS * SASHGLPKCWDS RCEALR 
PASPLISGPHQTI 


9836 


23737 


A 


9909 


316 


63 


DQLLVRSIGFEALMSPIFFFQMEF/SLS 
LPKLE CNGAI SAHRNFCLLGS S DPPASA 
AV* /PiXIANLLKLCASVSYCIELVLNELH 
WIK 


9837 


23738 


A 


9910 


370 


461 


F*F*FLFSETESHSVA\QLECSAMISGR 
CNLRLLGSGDSPATREAGAQESLEPRRQ 
SEPGLCHCTAAW 


9838 


23739 


A 


9911 


294 


28 


PGKNSTVQKEPGKWFCEMKFTPGENAVN 
IV*PTTKDLEY/WINI/DKTVAGCERTD 
SNFERSS TMGKML \ SNS IAWYTE I FRGR 
KSQLMGI 


9839 


23740 


A 


9912 


307 


463 


CKL*TLSNNDVIVGSLIVT/TCTTLVGV 
LIMGD / RLCMCGEKE YLGNLGTFLS IL 


9840 


23741 


A 


9913 


377 


28 


REMLTVKPLAP S F FRWSRAYRAI FF I T P 
TWVSKSGKVTGPLFFLHIFFPGP/CPGL 
PIFFFIKTKGTRFFFGWFFFFERVSLCC 
PG * ME CNGAI S AHH KLRL PG S R YS PAS A 
SEFHR 


9841 


23742 


A 


9914 


384 


725 


HLTAI RMTVI KDTENSKCERGC \ KTLMH 
S CWE YERVQPLWKTRW* F / LQQVE /LP S 
P CD PS I P / LLRMYPK* LNTD / CKRMC / C 
I PTS I AALFPVTK/ SWKQ 


9842 


23743 


A 


9915 


187 


3 


QPHKHLGLDNIYIFFEIGFCSVAQTGVQ 
*RDHGSLQPRPP\GSRDPPTSGSRVAGR 
PRQENG 


9843 


23744 


A 


9916 


15 


167 


DGVSLSLPRLECNGAILAHCNLR/LPGL 
K*FSCLRLPSSWDYRLPPPHPAHV 


9844 


23745 


A 


9917 


345 


477 


YCSRT S L PL / TLKKGS A* PGAVAHAYNP 
S ILGGRGRWI T * GQE FK 


9845 


23746 


A 


9918 


460 


42 


HQR P KVD KTTKMGE KRSR KTGNS KNQ S A 
S/APPKERSSSPAMEQSWTENDFDELRQ 
EGFR* SNYSELKEEVRTHGKEVKNLEKK 
LDEWLTRl/SNAEKSLRDLMELTPKA*Q 
L P P E RTNL S S * FNQLE ERVS VME DE INK 
MK 


9846 


23747 


A 


9919 


282 


21 


AAGYLQPS EFWLHS LGSRRGL P I S VFP V 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
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RGRQAWLSCDGYHPVRPSRLLCLPTQA 


9851 


23752 


A 


9924 


254 


3 


PGGFYFPGFLPPPLGFPKGFL*RLWPPC 
PATFFFLYQISKFPFFFFGKYQKNFFFF 
FFFFPES KSRS VT \RLECMGAI SAHYNL 


9852 


23753 


A 


9925 


309 


1 


YTPTIMLLGIYSNELKTNVHMKT*I*MF 
TATLFLIAETWKQPRCSSLSEWI/QQLW 
YIQTI/EYYSWRRNIK/LMKTW/MKLK 
YILQSEKTQSEKTTYI *FQLYDI 


9853 


23754 


A 


9926 


237 


2 


RWNFNFTQLILKCIFLRTKFFS*VSK*N 
FKYFLGFWFFFFLRQGLTPVT\RLECSG 
TISAQCNLRLPGSTDFSASASQE 


9854 


23755 


A 


9927 


244 


1 


KP Q CLWRK * D * WHNTQ KLD VE KGKP L I H 
YWWERKLV*PLWKTVWRLMR/NLKIELP 
YDPVI PLWGNYSKEMK* LCQREDSRA 


9855 


23756 


A 


9928 


2 


644 


IVQVDQKIRLNIHCLQETHFKCKYTYRL 
K VNR * R * VYHVN I NQE KALVA I DGAYFR 
ARKITRDKERHY/ XMIMGSMLQKD 


9856 


23757 


A 


9929 


1 


364 


GTSGTRPINNTTVFSKCSSERKT/R/TH 
LTLNQKLEMIKLSGGGISKTDMG*NLGL 
LHQ TVS QWNTKE M FL KE I KSATPLNT * 
M I RHQNS L I AAI GKVL WW I DQ /TQHNT 

LLSQRIilWNKALT 


9857 


23758 


A 


9930 


88 


374 


ILKACGPCPGNPHPWGPL*GQGGLGLNP 
SKSPPHPHPRGDPTWGGTPIGDYGPKPP 
KLPHGAGLPKKPCSSGGQNN/ PKISGP 


9858 


23759 


A 


9931 


384 


2 


LCSPRFFYRKIWGGGQGSRLLPPPFWKP 
GGKDCFSPGVLDPPGEQRGAPCFP*KKI 
RWG*GAAPWPPPQEG* /VGRAPLTPKI 
PAPVSFGCSPALPGEGQSETPFFLKKKK 
KEKSEAMIPPPPGPRA 


9859 


23760 


A 


9932 


32 


359 


IFFFLFFF/ C*KGGPPPVPQAGGQGPKT 
PPWPPIWAKKTKPLSKKKKKKKKGAGPP 
PPPKGGPPPLKKGEGPLLKRKFLKKNGG 
ENFKKKKI WAGGGGNPRE P P P FGGAR 


9860 


23761 


A 


9933 


138 


1 


TGFFCCC* SFALVAKGGVQWCDLGSLQP 
PPPG\SSDSPTYASCSC 


9861 


23762 


A 


9934 


103 


355 


LAGQACCS * STSANRCVNHA \ PGYANGR . 
CDTITCI LVFIRNS WLI TP I AL ITMFCY 
HVG * VRPQEQAPGNRVS IiS DLXjL\P P VT 
C 


9862 


23763 


A 


9935 


29 


363 


AWSHNAFLLFFFKRELAFVPQRGERENN 
FVSLKPPPPG*SPFSAPNPPKGGNKGPP 
PPPQLIFFYF*/GEKGFSPYNPEGPKPP 
TLGKPGLF*NTKKICPQPRGLKKGGQHI 
W 


9863 


23764 


A 


9936 


173 


363 


P KKRP 1 1 S LKRAFNSNPGD * QKFLKRGL 
PPMGSQFKRGNYPYDKNLEKLCPGGGNK 
GSG/YHYGL 


9864 


23765 


A 


9937 


360 


3 


NQRMRKNQSSKTGICKNQSACPPPKERS 
SSLAMEQSWMENVFDELREEGSR* *NYS 
ELKEEVRTHGKEVKNLEK/NLDEWLSRI 
S/N*EKSLKDLMELKPKAQELHGECPSL 
SSQFNQLARA 


9865 


23766 


A 


9938 


194 


370 


VS NF F F F FE TE F C S FAQVGAQGQNFR * L 
KAPPP\NCCLFPT*ASRWGTTGTCHHS 
WLI 


9866 


23767 


A 


9939 


426 


1 


SQEFKVAVSYDLTMHSNLSNSETLSQKE 
KTKQTKTKQKKTY*EGRKHIKRCPTPLI 
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nucleotide deletion, Y=possible 
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IREIQIQTTVRYHFTHIRI/VLLPQKQN 
KCWCRVEKLKPLVRC*WECETVQPLWIT 
VWW/FLKILNIKLPFDLATPWCirQME 
FSC 


9867 


23768 


A 


9940 


255 


16 


RVIYNIYIMYILCIYTPSRLLVEIGFHH 
VGQDGLDLLTS *SX RLSLPK\ CWDYRYE 
PPCVNIYIYLHIYLIYMSDGVKY 


9868 


23769 


A 


9941 


206 


355 


FLQGHTAIKNGRG/WPGAVAHTCNPSTL 
GG*GGRI I * GQE F KT S LTNMVE P 


9869 


23770 


A 


9942 


3 


402 


HEELNPRSATSS PALVLHVTIAHVTIAF 
AEGCFQHDFCFSQPENRHIPLVFTTASK 
VRN I F YL L CRGRG I LRFFL VCL K I D PA I 
PLiMGIYPK\KSLYHKDTYIHMFIAAEFT 
VAKIRYQSKCPSTDD*IKKICV 


9870 


23771 


A 


9943 


79 


328 


C I KILLKFLCLFC I LPPCLNFFLRQSCS 
VARAGVQW\SLQP*PPRLKHTPTSAS*S 
AE TRGTHHHAHL I FS YNFL * R * VS HVA 


9871 


23772 


A 


9944 


321 


414 


VPSWAG\PVAHACNPSTLGGRGGRIT*G 
QEFE 


9872 


23773 


A 


9945 


142 


480 


QEAKS FDFS DS TTME VMLDKKQ I RVI FL 
FEFK\MGRKAAETTCNIENAFAPGAAND 
*QMRWWLEKFCKGEESLEDEECSGRPSE 
VDDDQLRAI IEADPLTTIQEVAKELS ID 
H 


9873 


23774 


A 


9946 


191 


3 


FFLPRGGGDSVFKKPPKKKNGFPKIFPF " 

CPGFPLAT/lNFPPQIFFFFFFF*DGVS 

LCHPGWNAV 


9874 


23775 


A 


9947 


451 


461 


K* INK* KVKGW/RKEHHANINQKKVDVT 
IFISDNMNFRKKNKITGDREGRYIIIKQ 
P IHQENI AILNVYVPNNRVAKYVK* KL I 
E L KKE I HKF T I LVGD FNT ILS T I DRTTR 
*KVS 


9875 


23776 


A 


9948 


186 


405 


NHTLAI S L F * PYKP *AILKLG I YRKERK 
LVC*RDI CALMFITELVTAAKLWNQPKC 
PSTDK/WKMWYICTMEYYS 


9876 


23777 


A 


9949 


18 


412 


PEFRITTILPEKKNLL*LTN*VSKFSGY 
NINI * TS VAFF\ YSNNKI SETGI KI VPF 
T I LSNRMKYLGM/ TLTKNMKDLHTENYK 
MLRKENEEYQMGNMGSWILKINIVKISI 
LHKVIYRVKFLSNFNAILLKK 


9877 


23778 


A 


9950 


1 


396 


LAFDGRAERLKTGFHRVTQDGLELLTS * 
S VRLGLLKCWDYWRE P / P VPGLWD 


9878 


23779 


A 


9951 


307 


405 


GIRVRR* PQGLRPPNPDADVFKKMVWPP 
P TN\ PRGWGGKGWE p pqgpqdp p pq aap 
GFLPETWRGPWNSLPTRGAWPPNSNLPP 
WPPAPSRHPAQRAPGPSTAAPPPGGGGE 
GWDPPGRV 


9879 


23780 


A 


9952 


2 


402 


YGRPKEGGSLRFWLTTPGPKGETPFFL 
KTQKITGGGGGGP* FPLPRRVRQENSFY 
SRGGGFN* PKRAPCALTWGKKQKFFLKK 
KKNTPQKPKKLPGCRIQY*TQK/ SSSLF 
MGGTPGKRQ 


9880 


23781 


A 


9953 


249 


1070 


S S FISRFYFI CI YLFETRSHS VTKARVQ 
CCDYN* LQPQPPGAQE ILPLQPP * SAG\ 
ITGVAPNTPQLIFYFFVE/ SGGSTHVAP 
RLALNSWAQNIAL/RLPKVLYFTF 


9881 


23782 


A 


9954 


75 


396 


GFKGRKRGLPLPLNQKLKTLNLSEZGLS 
KANTGQKLGLL / RPVS Q WNPNGKFLRE 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso)eucine, 
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nucleotide deletion, \=possible 
nucleotide insertion 














IKS AF / P I EK* VI * KQ1STS L I PDMEEVLG 
V/WKTKKYSHNIPLGLSLTQKRVLTLF 


9882 


23783 


A 


9955 


181 


458 


RLWNDGLYNI FDLKLQ/ S IHV* SLLLLF 
ILEPESHSVS\RLECSGAVTTCCSLNIP 
GWSDS P P QAFQ VAGT TG VC PNAW VP S AF 
L 


9883 


23784 


A 


9956 


67 


430 


LAPRLIYSPIKFQWSFMSVRQIQVQSIiA 
I GNFHNPRLP I L I CHCYAFHR I LLFQMV 
RNF I L YS I S I S ANKQES KC I LKYI LLY I 
NLRNWLGT VAHAYN / PSTLGV* GGR IS* 
AHEFETSQGN 


9884 


23785 


A 


9957 


77 


422 


LPLALWAS CRPQLCLDTDF IHGPPTSGA 
ASQFLCFQTMQPPTCSSSRLYNL*PALV 
LiDHTTSNWCFRSP/SFFFLLPLPLPPA 
PLP\ FFFFFFFDRVLLCHPGWSAVARFQ 
LiTAAS 


9885 


23786 


A 


9958 


3 


422 


LHSS L VTEQDS I S KKKKKKKI FLGGTKK 
GGPSLGGFFFGPPGFPKIVKKKRGALIG 
FPFFCPKKKPKKPLFFPKPLGKPPRKKG 
F*IGEPWKT/EGPGFPKKLNFVFFSPWG 
GNFLFPPQKPGGKVGKILLQPKKKPPGK 
G 


9886 


23787 


A 


9959 


370 


2 


PFSSWVANFREGDGEKCSVLQGIRFCMF 
CRYINVTPLVHFPFPISKGTSFFPLYFF 
SPEVM/HFP*LPY*FGWALLSRF*/MSL 
L*LFFSKFPFFPSFFFF*RQFHSSPRLK 
CNGAISAHCNLCLP 


9887 


23788 


A 


9960 


256 


2 


AWEEETAPLHSHCLGNQARPWNHRHAP \ 
P * L I VF VFS VETGFHHVAQAVLELLAS S 
DPPTRFNLLSSWDYRGLSPQPMSRKSAQ 
EG 


9888 


23789 


A 


9961 


269 


3 


HRQACGQVRSPIQCWWACKIAQPSLQIV 
RSSLKG*I*SPYDPVIPLLVIYPRELKT 
\SLHTKTCT*RFTAALFIIARRWKQLKC 
PSIDE 


9889 


23790 


A 


9962 


268 


324 


SGTLSH/ W I FS* VG IE ITMWLKVPDQTK 
KI SLRS AEAAIKYFLTQATAS 1 1 IRRAI 
LFNNRLSEQSS I TNTTNQ YS SLIIIMAI 
AI KVGMAPFHF* VPEVTQGS PVTSGLLV 
L 


9890 


23791 


A 


9963 


341 


2 


ITFLPTKI IRQSNFKTGKRKTDSMDLNR 
HFKREDTQMANKYM * F I * KMQ I \ KTTMR 
YYFISKRTTISRKTSNIQC*DTWSPGTF 
INCLLECKMVRLLWKMTWQFLTEVHIHL 
VY 


9891 


23792 


A 


9964 


250 


2 


YHPVNLYFKYISPIDV*FGCFIYIFDYP 
LCSCFSLHPLVYLQ*L/LFFFFEAESHC 
VARLE C S G WS AHCNLRLPGS S D S PAS 


9892 


23793 


A 


9965 


3 


334 


RTLRHHI WL IDPS FE *HLS CFHLLAIVN 
S AAI N I P C / HS Y V * THF * YS * M YRGRTA 
GSYAT/SMLNLLRNCSTLLHRNCTF*PF 
YH/ WE * VP I S PQS CQKLDF FIFLIIAIS 
EG 


9893 


23794 


A 


9966 


179 


352 


NQE*TEILNRPMSNQIESVIKYLPTRKS 
HRPHRFTAELYQTY\ ILLKLFQKVEGEG 
LL 


9894 


23795 


A 


9967 


2 


350 


THPSINSFIHVFIHASMNSLIHLPTHSF 
SIYLFTKSFIHLFIHFFSFLFFLRPSFT 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I— Iso leucine, 
KHLysine, L=Leucine, MKMethionine, 
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PVAIYSFIHPFTYSSIH*FIYPCVYSCI 
DEFTHSFTHAFIQHLFIH* II/IFIYSF 
ISFLFF 


9895 


23796 


A 


9968 


228 


558 


CHQLRQEIAIFTSFVILQLFSGHLDVYM 
QAWAQRPDKYEYDNK* FIEI KKI IQFTL 
ISKRMK/YVGINLTR*VKDLHNENYKTIi 
MKEIESDTSEWKDISCSWNGRINSVKT 


9896 


23797 


A 


9969 


196 


2 


DRKVFFRPLFIIFGLKKKIFPLFWFFPS 
/IRGRG*IFFFFFFFFETESPSPRLECS 
ETISCHCTP 


9897 


23798 


A 


9970 


245 


1 


TQCDMCVFIYTHIYISHICYMVYIYIHT 
RTHTGILFTPKREKNSNTCTCATKDNP / 
CMTLC* / HKRRQTHTHTHTHTHTHTHT 


9898 


23799 


A 


9971 


163 


380 


NHRRQKKNGR/ HNGNKYKTVTKMINSNP 
P I TRNNLNVNGLKTP I KRQKLSDS I KKQ 
YPSIYCL*KT/YLKYKD 


9899 


23800 


A 


9972 


328 


1 


AKCGFLiKKLNIELFNS IPGHS CKT* TQM 
FAAALYIISKKYKQPSCPSNDE*MNEIW 
H I LT I GYSAVKKI S WG \ RWMNLENI ARL 
SGRQRGHILQDSVDMKSPE*AIPQ 


9900 


23801 


A 


9973 


200 


3 


MASKCSSERKSHR/SITHNQKLEMIKLS 
EEGMSKAKIG*KLGFLCQ\VIHWTAKG 
KFLKEIKSATP 


9901 


23802 


A 


9974 


111 


517 


NLEQPLIKRTIANKPIKKINTTILI ITI 
NVMGLNTLTK/ RQRWSD* IFKNQDPTMI 
/C*KKN/HCKYKDTNTLKIKG*KNIYDA 
NMLTQ I KKKAGVAI L I / DKIDFRA/ TD I 
TRAKEGHF I MI KGSVHQEYVT I LNVSAH 
AS 


9902 


23803 


A 


9975 


342 


3 


FIATKKIFFFFPVFPPQGPPPVLVAPFF 
WAFGGGPPKWAPKKKFFGPRGPPFENFF 
F*GFFFFKTGGFFFBRFSKKVFFFFFFF 
SETESCSVA\RLECSGTISAYCNLCLPG 
SR 


9903 


23804 


A 


9976 


13 


326 


ILDHSNRPSSSTLKTPNAGKDVEQKQFL 
/LCCW*ECKIVNYIjGR*L/WQFIjTKIjNL 
L*PYDPEILLLGIYQKELKTCIHTVACT 
* MF I EALFV I AKT * KQPRYS SVG 


9904 


23805 


A 


9977 


74 


396 


LAPRTQPLPCSPAPLRQCHTQSIiLPPFL 
IjRL IFYFFCFL KTRAHS VAQAGAQ * RDH 
GS L QT * T PTLACT P ASA / VART TGTHHH 
AWL I VF FIi /IE KVSHShVCDKT 


9905 


23806 


A 


9978 


287 


2 


QPHNSSYTQIFTPKIIIKTPPPNLKQNN 
TTPIKKKTQ*FP/PSTNP/SYPSTHPPT 
HLFIYPLIHISPPPSPTHPRIHPPTYLS 
IHPSTHPSIHPPSR 


9906 


23807 


A 


9979 


97 


332 


GNNDLFLYFFFFLETGSHSVAQARVQWC 
DHSSL*PQPPGSSGPPISTC/RLGLQAC 
MCHHAWLIFNFFCKGRLSLFGLIiR 


9907 


23808 


A 


9980 


168 


2 


STWLRWDYWKTFFKTSKS /WPGTVALTY 
NP YTLGGQGRH IT* GQE FKTSLANTVKS 


9908 


23809 


A 


9981 


267 


1 


LKRERR*RRAN/ SKTSRRKE I I KIRAE I 
TATENRKP IEK/ SNKTNG* FFEKTDK/ 1 
DNPLERLRKKERD* ITKVRNGREDIL*N 
NDLIKVKKD 


9909 


23810 


A 


9982 


230 


412 


CHIYLC/D*PIDGIV*CHIYIYFGHKHT 
HTYTHTHTHTNTHPTC*VSKILLKKKRR 
TCSVCA 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GJutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R— Arginine, S=Serine y 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, - : =Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


9910 


23811 


A 


9983 


229 


415 


SYVFCHIRFCPLDSLNKEIVA/SQVQWC 
VS WS TT * VAE AGGSLE PRS LRPDWAS Q 
RDPISKN" 


9911 


23812 


A 


9984 


2 


288 


RG YD S T S S AYT VGH FT * E D KAT I TRL S G 
KENVKHAGGETLRRLLDVYP * TQRIIDS 
VGNLNS A\ S AMM \ GP PRL P KVLGLQ AWT 
TMPGP*KYIHIFF 


9912 


23813 


A 


9985 


246 


3 


FLLDYQYCALCCHCFLFNFANCGHFNI L 
LGGVKHTKFPKYS * KFSVL/WLDAVAHS 
CNS S TL S GLGW I TRGQE F E T S L ANMVK 


9913 


23814 


A 


9986 


342 


1 


FLNFGLTGFFPKGPPKPLGLRDGALPLD 
PPQFLGVFKRGAFFLGTQ* FHF\LEGFK 
FWFWEPKAFFFFF/ CCETISAHCNLCLP 
GSSDSAASASWIAGITGAHHYARLSRRL 
RQQN 


9914 


23815 


A 


9987 


330 


1 


NRPLNNLVMAS KCS SERKGYTS F VLS RK 
/LRKMIKLSEESMLTVTTG*KLGLLCQT 
/ VQVNTKEKFLME I KS ATPGNKGTVTKS 
NTLTADREKV*VIWIKNKTNHKIPLSQ 


9915 


23816 


A 


9988 


209 


2 


IKGSHFPVLI*YRKIIIIFGGVKGKGGF 
YSY\LFFFFEKGYHSVTQLKCSGMILAP 
CNLCLPGS*DSPTS 


9916 


23817 


A 


9989 


256 


357 


QHRWMNTEMLNYLC* I I YVHF/HTHTHT 
HTHTHTHTHTHTHT 


9917 


23818 


A 


9990 


263 


1 


IQIRVFENSHLWPSYVHFPHGHSYLFCS 
L* * AFFRH\G * F VLS VHFLKI FLNLKKI 
FFETG\SHSVAQACSGMISAHCNLCLLG 
SSDR 


9918 


23819 


A 


9991 


3 


368 


SLDPRRSRLHLAKIVPPRSSLGDRARPC 
LLKKLELFFETGFCCVAQPGVRWCSHGP 
LQPRIPG\SSNPLTSAT*VAG\IAGMCY 
HTQLQLAF 


9919 


23820 


A 


9992 


294 


391 


S GT VAHACNS S TLGGRGGW I T * GQ / E S Q 
TSLTNM 


9920 


23821 


A 


9993 


317 


3 


TDKELLjLiRDEQRKC fveme FS PGKHAMS 
NVDMTKDLEYS INLVDKAAVELERADSN 
FERS S TLGKMLSNS I S CYKE I FLERKS Q 
LM*QIPLLLM/FKKLPQPP*PSA 


9921 


23822 


A 


9994 


193 


395 


IFWITCCSSLSVLAASPCTFMLWRQLLS 
LNLMNQ * DHSLSQS L IQSKS ATLFNSVT , 
AERGDESAEEKLEASRSWFMRFKERSCL 
HNIKVQGEAARTDR/EAAASYPENLR/V 
KDEGGYTK*RIFWDEIAFCWKTMPSKT 
FIAREKS I PGFKVS KDRMTARCGRSCL 


9922 


23823 


A 


9995 


82 


420 


S FLWKL G PRGAAAYVRCVSA \ LLED VSQ 
LGYTGVRDPIEEAVSPFSELKRCRERTT 
AVFTAVRQGHLSLQK\FLLPFVQLCPAH 
RGGV * RQ * ALL C CGGL S P VQAP L P L CL P 
TQ 


9923 


23824 


A 


9996 


376 


1 


CGRTDAPAS TS P YAMTVRFLRP PQ P CVL 
DGLHNC P ARAP QE QNS LGE VD KRG P REQ 
TR\PATAAPPRPLGSPSWICPGDW**CR 
NS LQG F S LAAS VARGRD WALAGL PAT VS 
ARFQEQQKMNTV 


9924 


23825 


A 


9997 


495 


982 


VQKFLRPNLAPKKHQRKLAPNSLQGRLR 
SLPSPTVWCTMAPPTGVLSSLLLLVTIA 
VCLWRMHS WQKNHW*AS FKKTHD * TGLA 
E P\ S CARKQCSEGRT YSNAVI S PNLETT 
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T.v^inf* T .=1 t*iif*! nt* 1VT=1VT p+Tiinriiiii* 

-IV JujoJUIC, JLj MjCUCIHC, ItI — lYAClIillMIIIIG, 

N=Asparagine, P-Proline, 
Q=Glutamine, R=Argmine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y-Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possib!e 
nucleotide insertion 














RI MRVSHTFPWDCTAACCDLS S CDLAW 














/ WFEGRC YLVS CPHKENCE PKKMG 


9925 


23826 


A 


9998 


401 


341 


IASKCSTERV/SRSHTLKQKLERFKL*E 
KGM/ S KNE I GQK* ALL YQ/ VS QWNAKE 
KFLKE I KVD \ LHAQM I RK * NS LTAD VKK 
VLWKIDQTL/PLGHSVIQSKALTLFSS 
MKAQ*GEEAA*EKS 


9926 


23827 


A 


9999 


30 


393 


DTTILKLRQLIIGAVATKHWSERKSLTS 
LNLNQKLGTI KLTEEGMLKAKKGQKRGL 
S CQTVGQVMN/S KE KFLKE IKHAI PGNT 
*MISK*NSL/IIADMEKVLVG/WSKVIF 
*DQTSHNIP*SQN 


9927 


23828 


A 


10000 


133 


1420 


EQRRLPVTPGVASLQTLCCFLPRRGCSH 
G*ESAATHPPGPELVLLLLQGH*AGYLG 
/ DFG / P RTATG S P G \ P AGVQ PGP CL VP Q 
LSRAAAAG*PDPT* *GAIFSYHCKALRA 
TAARHCPPGC*WPG\PAGSPPGTSRPLS 
HPSS PS PPWTGVSS FLLNLVLCP *ASDA 
QGHACYPGPVPALAGQGHDSAGGRLGDP 
GHHWQQAALHG /DQGPPGHQAHAGTQPS 
AQGLGC * ATHPGEGG* VGYQGLPAG*GR 
SHPRFPIVTRPMGRSCPYATVRPGPGSQ 
GQTPPGLV\PRGEAGQNPGGAAPGLSCI 
QEVTGRWWW*GKKPLGHYPPCGAVRCPS 
S * PHPDTMFKGTGREGCLPPTSPVGVTG 
QMS\EGSRPCEWTLTMSPWGE*SPRASC 
AMLHFCPGSRVGGYHGCPPLHHMGPQST 
AGQAGQPHTLD I KAS 


9928 


23829 


A 


10001 


1818 


6682 


I KFP E AE E E E I Q E VL VLLFR VAE E KENP 
SRVGNS QRE YENQ WLYSI CNQLR YRNN 
L/RHVKKDERGYYEELLNYSRDHLMLYP 
YR LLD I MVKGFS I T P F S Y YTG I MEN I MN 
SGHNFTAADCLRLLGIGRNQYIDLMIQC 
RS S KKF FRRKTAGDLLP I KP VE I A I E AW 
WWQAG Y I TE DD I K I CT W PE KCATD KT V 
DSGPQLSGSLDYNWHSLYKKGFIYLDV 
PMSDDSCIAVAPLEGFVMNRVL/NFDYF 
ETLLYKI FVSVDEHTNVAELANVLE IDL 
SLVKNAVSMYCRLGFAHKKGQV1NLDQL 
HSSWKNVPSINRLKSTLDP*KMLLSWGG 
GESRRPVQEASSATDTDTNSQEDPADTA 
SVRSLSLSAGHTKHIAFLFDSTLTAFLM 
MGNLS PVQSTGEGEAQRYFDHALTLRNT 
I LF LRHNKD L VAQTAQ PDQ PN YG F P LDL 
LRCESLLGLDPATCSRVLNKNYTLLVSM 
APLTNEIRPVSSCTPQHIGPAIPEVSSV 
WFKLYIYHVTGQGPPSLLLSKGTRLRKL 
PD I FQS Y\DRLLITS SG\HDPGWPTSH 
VLTMLNDALTHS AVL I QGHGLHGI GETV 
HVPFPFDETELQEDSC\NMGVHKALQIL 
RNRVDLQHLCGYVTMLNAS SQLANRKLS 
DASDERGKPDLASGSDV\NGSTESFEMV 
IEEATIDSATKQTSGATTEADWVPLSLC 
FG I PLFS S ELNRKVYRKI ATHGL\ * EKR 
AFKNLLHSSRKLSLQVLNFVHSFQEGAS 
TLDIHTEPSFSSLLSQSSFADMGVPLPA 
KKFN/SLKIVSY 


9929 


23830 


A 


10002 


439 | 


1135 | 


MAVDFYNFVTKLWTTGYLRISFLAYKF 
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E=Glutamic Acid, F=Phenylalanine, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














FSFPFLDSLS/LLSPRLECSGVIPASLQ 
PLLPGFKRFL/CLSLPSS*GYRRAPPPC 
P ALL YF * VE TGFHHVGQAGLELLTS GKL 
TRLGLPKCWDYR 


9930 


23831 


A 


10003 


216 


572 


LTLVS IFTAKQSVTLPI ICMLEMRKWLL 
NDTSAF/PI I * KVLFPKI FFFFFF* KGV 
LFCPPGGREGAQFGLTKPSPSGLKEIFL 
PNPPGGWKQRRVPPFAGYFCFFIKRRVS 
LLWRGLPN 


9931 


23832 


A 


10004 


3 


354 


YSNNP KDRKGE TG INEKTNNKMVDLS S Y 
TSVI TLNF*ML * /D /KIN* KFVKSP * KK 
PTL C CLLE T Y F K YG I GGL KVKRWKK I *H 
ANTIGKKTLVAILITKQTSELS*YPNKE 
EHYILIKE 


9932 


23833 


A 


10005 


187 


455 


ASIHYSLGVRIGIYIPGCIHPGEKSQKC 
NTCGKNSTQKPELVN/HR*LKGHRCMK* 
NEGGKTFTRVQSL / ACHKRVYTERKS YK 
C 


9933 


23834 


A 


10006 


134 


456 


I WNLPLGKPCAWKQVTSDL I S PDAI FIN 
AAIHSLSFFFFFFETKFLFVPQVGGPWH 
NLN*LKLLPPKL\SDFLVA*VSKKVGIT 
GPPHHPGLVI WGFKKKGGS PMLPR 


9934 


23835 


A 


10007 


210 


451 


CYYTIQTTEQPANDSMLGIKPHVSILTL 
NVNNLNTPPKRLGVADWI K/TQDPAFCC 
LQETSLTCNRTHRFKVKG* /WKKIYH 


9935 


23836 


A 


10008 


169 


462 


YPCWVDMAKNTS YNKCWQGCGE IRILPH 
CRWEYKMASPLWK/SQFLNK*NMELP*D 
LPIPLLDIDPKEWKTGVQTKTCR/RMFI 
AALFPTSRS 


9936 


23837 


A 


10009 


490 


724 


EMPP * FME KNFLAPGG \ VT P PGVNTTR V 
NPKTAFDPKRSSSPFLEKSQKPGLIEGL 
THLKTL I LNPLKKDESGE I PVLF 


9937 


23838 


A 


10010 


306 


588 


KQF I RAF I EMS WWLT P VI PVLWDTKADG 
SGIRDQPSQHGETPSLLKIEKLAGHGGA 
GL* /SQLLERLRQENHLNPGGGGCSEPR 
SCYCIAAWVTE 


9938 


23839 


A 


10011 


473 


53 


ISLGQDSTDISTINIPRMAFKLSSKRKS 
HMS FS LNQK/L / EMI RLS / EE AC * KPRS 
Q KLGLL S QS*PSCES KFL KE TKS AT P VN 
TQM I * K* NS L I TNK\ LKE VLMVW I KDQ I 
KHTISLRQSVI * CKALPLFNSMKAERGE 
ESAKC 


9939 


23840 


A 


10012 


184 


455 


FWLSLIKKPLSSVTQKKINRET\ILLTT 
KKKKKD CNFLEGGLL I GGS VCRLVYRH Y 
ATLYSVFCVDSSKSERGSVDPTQVFVET 
*DKCFDY 


9940 


23841 


A 


10013 


203 


453 


KEELYISRERKSSTSLLLNQSQK*LIKL 
SEQAMPNANIGDRLKARPLSPNSHLIKT 
KQKFLKEV/KCATP 


9941 


23842 


A 


10014 


237 


483 


TLIRYIILQGLFSYIPPFFFETESRSVA 
QAGVQ W * D LGS LVP G S RHS P S S AS Q VAG 
TTGFHY/HAWLI FCLFCATGS 


9942 


23843 


A 


10015 


157 


875 


DD P VRGRGE E SAM P S GGRRNRS VS S S WV 
G S MAG I TT I E AVKR K I Q VLQHQADDAE E 
*AEHLQ * E AE GKRWAWE Q AE AE VAS VNG 
R I QL VE E E LDCAQE C LATALQ KL E E AG K 
AADESERDTKVIE I WALKD/E/EKMELQ 
EIQLKEAKHIADEADGKYEEVACKLVI I 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C-Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
Q=GJycine, H=Histidine, I^soleucirfe, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V= possible 
nucleotide insertion 














EGDMGCTEERAELAESRC*EMDEQIRLM 
DQNLKCLSAAEEKYSQKEA* CEKERKIL 
TDNLWKSPMR 


9943 


23844 


A 


10016 


579 


30 


CG I S T KKNKWM I KQNS S E L S T C VHNQ I T 
LHKIAMATQWG\KNSFFNK*C/W/EN*V 
ST/ CKKI KL S PSVTCT I * KQYLKWI KYL 
NIKI *QTS*IS\NGGKYDIGLGTNFLLQ 
KTKWLMS FS * L * NQMHKEQ I X EKWDY I K 
LQNFCTSKEKFSGVKMSPKKW/DEIFEN 
HIFDRS*FSQYINNV*NSTKSE 


9944 


23845 


A 


10017 


44 


228 


E F VCR VRGL WW \ CWLAS QLL WEAE AGG 
SLEPKSLRLQ*AMIM\DCTPPW 


9945 


23846 


A 


10018 


471 


40 


PGEGRFQQTRF P P FLP PWGP KGGPFFKK 
PKKKKRKEM I KLS * KPRQANQ * LGLLCQ 
/ T S QVVNAKE KFL KE I KNAT P VNTQRVR 
KLNSLIADIETVLVVGTEDQTSHNIPFG 
QSPIQINILAVFHSMKAEQSKEATEEKF 
EARS 


9946 


23847 


A 


10019 


217 


486 


KKI S * PVYLSLYLS I YLSVCLS VYYLS I 
YLSIYLSIYLCIYLSTIIYYYLSIYLPV 
YLS INLLS I / CLS P I YLLTYHLSSYLLS 
YLPTYLP 


9947 


23848 


A 


10020 


327 


1245 


TENQQGVLALAALS KIRMMPTRLKWFLN 
ID YTKFCEAFHVFKKRKKKKINKNI kql 
KMPLTKRIYLAFTEYSTQ*QQNTH\FFL 
SADETHGKIE /HVLGPKTRHNKFKR 


9948 


23849 


A 


10021 


329 


490 


GE S P TDNHCR S P KS Q VE KV IYTFFETES 
RS VT P AG V* WCNLGS LQP P P P \ GTSD 


9949 


23850 


A 


10022 


3 


394 


YRVLE VLG YMLN I RFWLGCSS CGQ AAQ 
FREGSRCTTACRHPAHTLHAAKMSRRKV 
S S AKCR * RETP KKRS ARWS AKPAP \ QS E 
TKPKKAAGKDKSSTSS\F3S3SSLtKAEK 
GGKEKQAEVADQETKDLSAE 


9950 


23851 


A 


10023 


448 


477 


S Q VF E T S LAS * MS KN I L / WAGTGAHT CN 
PSTLRGQGGQIT*SQVFETSLAS 


9951 


23852 


A 


10024 


198 


455 


S LKTNRRE I PED FHGQPCG KL VHS E KP R 
VRMGQT E S KYAS YL S F I KVI LRRGGVRA 
S/TRNLIMLFQTIEQFCLWFPE*GTLDL 
KDW 


9952 


23853 


A 


10025 


469 


26 


PPRAVQPPFLPSSSLHTCVPLCRTVFSA 
TSRPDSLLPALQVSAKMSPVL*SSLLLP 
S LGQGFPMAS I APRL / PHRS LRS C WAA 
AHSRAELDRHEAFLQCSLGQWPRNDFLF 
S FFRDRVLLCHPGWS TAA* S * LTAASNS 
GAQVMEFQ 


9953 


23854 


A 


10026 


222 


489 


KKKKKKKKFCFIARVKSNGAISAPCNLC 
LSGSNYFPASTSLVSGNTGACPLARVNF 
F Y*NL \ FLVKMGFHH I GQE S I 


9954 


23855 


A 


10027 


313 


2 


QDWE I S KL YS YCHS * VRRL / IFLKIRLS 
LYY I LQHSLNCN 1 1 G * LQFE S C Y I KLAG 
DKIAFRFLTLKTNHQLGLVAHAYNSSTL 
GGRDGWIT*GQEFKPSIANM 


9955 


23856 


A 


10028 


49 


388 


TPKI RGVFP FKGWFLGVFGAF P FPQKD I 
F YF P RT P MGTRLVS PGKKKKKE V* RVL Y 
GLKKAKKI FVLKVKFSHKRGFFGE I QFK 
KLFKW/HLKRNPPPKKKKR 


9956 


23857 


A 


10029 


394 


231 


GRQIASAQEFKTSWGNMAKLCLFI *RDR 
VSPCCPSWS* IPELKQSACLSLPKY*DY 
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peptide 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, HNHistidine, I=Isoleucine, 
K=Lysine ? L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine 7 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














RC/R3NTTTPGL*HLLFRCLLIJLPSYLFF 
CKSFKLLYMDI 


9957 


23858 


A 


10030 


265 


2 


HDNMRGSDYFPASNGLAVPFKRQVFTIA 
SNFLEKS PPRLAFT * LQVD* ERKWAG\ P 
VAHTCNPSTLGGQGGWITRGQEFKTSLA 
NMPKP 


9958 


23859 


A 


10031 


235 


413 


KAGKLFFFTE ITFKKGKKAGGGGSFLKP 
HLFGRPGGGNHRVR I L * KRGF YKMGRVV 
LKSLPCDSPPLASQKGGVSRKSPPPRPF 
CPF * RLFP * KKKAFRLF I PKKV* * ALV/ 
SLPFIGEGKTLFFFF*DRALLCRPGWNA 
VAPSGLKLSSCLSLLRS*DYKHVPPCLA 
MF*KFF/ CRN 


9959 


23860 


A 


10032 


167 


424 


GREFDLRMGHMVKPPLSFFFFPFSETMS 
LSVTQAGVQW*DITALSASWTPVISDSP 
VSAS RVAG\ IGTTGVRHHTQL I FL 


9960 


23861 


A 


10033 


252 


3 


VEGCLSSGVQDQPEQHSETPSLQKKIFF 
\*LAGHGSMCL*SQLLGRLRWEDHSSPG 
G* S CSEP * SCHCS PAWATE *DP I SKKKK 


9961 


23862 


A 


10034 


175 


492 


CPTCPFVWIPLCSEQPVQLYMWHTHTH 
THTHTHTHTHTHA/HLLFSLS FS FLRQG 
LSLS PTLWRS GMI / CGSTQS QSPRLKQF 
CSPRYSRG*GWRMA* /DPGGRGCSE 


9962 


23863 


A 


10035 


200 


448 


LYLHLVQLSPLDFPTPLFRVECPVSLSL 
CLS FS * FP * VCFFFLVSLGLF* GRVLLC 
RPGWSAWQSQLTAD/ SDFPVTiKHSSCIi 


9963 


23864 


A 


10036 


229 


463 


MILGISKISFSLAITLANFPPTLNLFIF 
LDTGSCSVAQADVQWHNQGSL* / SELLG 
SSDPPTSAS 


9964 


23865 


A 


10037 


164 


422 


GLSSQLTDGHLLALLRVGGSEVGSREES 
EREEAGREREGERRDSRGG*REREREES 
GAREEREKKEKRER/ERDREEK 


9965 


23866 


A 


10038 


1 


490 


PNQ I QNAVLIM I S FS I CFVS VI F FKSE I 
YNGQPKFKSFFFF*DESCSVAQAGVQ/W 
CG/LQSQQPRPPG\SSNP\PTSASGVAG 
ST 


9966 


23867 


A 


10039 


255 


446 


F FLRQ S C Y VAQARVQ / YAI HRC S HS T L Q 
LPALAS SNPSALIiAPRVAGTAGMQHHNQ 
L*YFFRIHS 


9967 


23868 


A 


10040 


157 


390 


DHTCPPPSTADYRLS * FLYFLLLLFRRS 
LALPPMLECSGAISAHCTLCVQETRETD 
AAHFKAARTIRAPHPAQLT IAFQVQTHV 

YQDGH/ dgs wwcdlrthpprs 


9968 


23869 


A 


10041 


290 


30 


TLKKRKEFSSVLCPRGYIVLLLLVHQFL 
FFQTGSCSVTQAGMQ/W/CDQSSLQP*T 
PG\SNNPPVSASQVAGPTGMHHGFLHTE 
YESRGPP 


9969 


23870 


A 


10042 


190 


929 


NHRY I S YLTRLVKI KNFANKS WLESLS P 
TLKHS * / W*CK* V* SLWKTVWHYLVNlA 
KDDLTFDPAIPFLMIYLIEMCAQVHPEI 
YARMFIAVLFEIFKHQ\NNPNKLWYIHT 
RESYKTIKIKE 


9970 


23871 


A 


10043 


457 


724 


HFGRPRQVDCLSSGVQDQPGQHGEVQSV 
LKL\KKIiAGHGGAHI»*SQLLWRLRHENH 
LNLGGRGCSEPRWCHFTPAWTTEQDSV* 
NNNNNN 


9971 


23872 


A 


10044 


111 


429 


KPDEDTTGRENYRPVFFTNTDANIiSKIK 
CMYPCISVH/HWMYGKIPEMMFT*Y** 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














*LFLSFFFYEMESGSVPRLECSGVISAH 
CNLSLPASSNPPPGFPR 


9972 


23873 


A 


10045 


30 


220 


LGVLYKKNSVFNKCYWDN*ISIWKRM\G 
LDPYLTPHTKINFKWIKDLNIITKTLIC 
CSWMKTKA 


9973 


23874 


A 


10046 


203 


361 


TYKLLVI F I VI DHF / LFS FLS FFFRDRV 
SFLLPRLECSGAISTHRNL*LPGSSD 


9974 


23875 


A 


10047 


198 


358 


IQRKCFTFFFFFEV* SCFVAQAEVQYCG 
SPG\SSNLPFSASEVSKTIGWILPS 


9975 


23876 


A 


10048 


270 


272 


RKNQRFFKIARKRLNKMTRISPLISIII 
LNVSGLNFPLKRYRLAEWTKKKK\DPI I 
CCL/QKTHFAG/RDrYRLKIKGWKKIFH 
TNGSQ*QRSEFF 


9976 


23877 


A 


10050 


256 


1 


GVTTNLYFWKNFLNPPFLVKKCENCP IN 
KVFP * KPPLLKKTLFFSKKKFCGV/HPT 
EKFFFFFFESLALSPRLQCNGTILAHCN 
LR 


9977 - 


23878 


A 


10051 


112 


359 


SKVSEEPTENEVEHNLKV*SFILPLKQY 
SM* * QKKNIHIH\ I S FYKK* ELTWPGMV 
AHACNPGRLGTEAGRSPEGQEFETSLA 


9978 


23879 


A 


10052 


346 


2 


RVSSYTLITLNLNRLMVALKRYRRLNGF 
F FLKKP * PN\ I CC I QKTNL I YKS TYS LK 
VKGWKNIH\HANGLKKQAEVAI \LFISD 
KKDFKS KI VKRDKEGHYVMTKGS I QQ * D 
RTIL 


9979 


23880 


A 


10053 


376 


6 


TRPSQHPDFILLNKCLLFEATPFLVICY 
SGHRKLRPVAPPIRGSKSQIIPMPPAVL 
PRWHLS CL S FR I P S S F F * LRL P RL CKS R 
/ S AWGPVS PHLPQ FLFFQAI S FFFFL * D 
GILLCHPGWSAVA 


9980 


23881 


A 


10054 


102 


347 


FLSFFVDGKLCLEAKIWVLAPAVFIVTR 
VLLLLSTL/RQVELRNIYLCSNVHIHFH 
LYFC I YLSAYILQTMS LY*HL* YHSNV 


9981 


23882 


A 


10055 


251 


51 


I S LVNWLF I LENMKLDP YLS PYTKINIR 
/ WQ * DLNVKNGTTKVLEEN * GT I YVMVG 
WKRPILACQQPKS 


9982 


23883 


A 


10056 


169 


342 


SQKQYSTCQNVFCFVLFLETRSHLV\TT 
LECSGAIMAYCRLDLPG*RHPPTSAS*V 
DG 


9983 


23884 


A 


10057 


1 


586 


AAARPAQGKARPGLIiLRRGRWELRAGF 
LTAKGAFR W WL S QKHVKMS YP P Q S RCG C 
GGMGAAGPPSLTVHQELCGAYSPDGTTE 
ALPLAS WPRARPS PAKAPAYDTAKLPAL 
IGCGSRRPPGVNPGASSLKPGACVSEGA 
GPTGTLESAGSRPPTPIjPPPV/ ccpgpp 
ARADH* FCHVTPQCR* /PPPRPSP 


9984 


23885 


A 


10058 


393 


43 


HARPGAECERPAEGEAARADAFSGRHAC 
PHSL YCRL * RGF F VHKLHLNKA/ RL I KR 
RKADNIKCWQRCGAPGTLMHC*WDYETM 
GLUjEQTVFS FEIRSHCAAQAGVQRNNHG 
SLQEF 


9985 


23886 


A 


10059 


228 


466 


GLLHGPPIYDLPEPVSTTPSLYHPPIVP 
*ASPLGPSAILTLPFHSPPPA\PLFSPP 
HSVLGSSGPRSLPT/SPTHLPTLTPH 


9986 


23887 


A 


10060 


293 


9 


DILQFYTFSPIQ\CPVYILHL*YISLQR 
L * FHQKYL I GI YF I E T WSCS VARLE CSG 
RITPRCNLNFPGPTYPPTSASGVAGTTG 
VCDFLGLPWSR 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


9987 


23888 


A 


10061 


199 


428 


TAY CRT L P LKKSNP TISGSFFF FLE TG P 
RSVPG * RAGG \ DLMAHC S FDLPGSRDS P 
ASPRQWGATGPCHHPWLIFE 


9988 


23889 


A 


10062 


191 


2281 


LQTRGKLFKYKEKGICTKLHRSEIIIGK 
QWKEAVCPPTGQAQSQGGLDPKSREALR 
GGHGF VPGGGVCGHQEL * LHSDGLKAVH 
PAGSE PSD PGGVQAAACQDAATGGTPLQ 
PMPE TARGC / TAT AS S RE AG PGMDP / RE 
A/GPEAWQLGKARSSWGGSSTSGGPAWV 
FIRDTPSWWGIPGTVSPPAGPRCPPPTR 
PQWEQLGSGRPG*MQTTAEETNAAPRAG 
QERTPSRGPSADPGVSQHVLSHRRSCPK 
CERRSVPPNG\AGPTLWTELHPSDASVS 
ALT PL * GDVEVGPYG\CEGYI KPGRRTH 
QHNWLDLDLRPAASRTCRKMLELPRV* F 
L LRLP ELAET YS S T P P ACAQRP RGTGGG 
/ PPQRCVLMSGSHQGGAS * KGRDAPSLG 
SGLERPREGGPGWLS /RQPSPHAVQPGK 
HMLCLGPAF*RGRGARGVPVPVRASHFP 
FHGAAWWPGDHMS Q P * WQMLSRPARGHR 
P S WQVRRQCTVLGALKPGLPDLKCPGP T 
AFLPG* GVS /EENGQ I PNVSTQL YLQNL 
PLPREQN*RRFPRDLSSGGAGPGTG*CG 
GGGGHALSPTPSRRLRAHSKDNSWAQPL 
CCWMPAASGC\PGIGCNGVPPVAAS*QA 
AA*TPPWSEGSLPAG\*QPSAHRCVTRV 
PGGRIPHLLAQAVPPDDRAFPNCQASGQ 
LLGVHSRPSLSAAGCRQPLAVSGIGCNG 
VPPVAAS 


9989 


23890 


A 


10063 


65 


453 


ISREFLFLETKSLYIMLKGSIQQEDIIF 
VNIYTLNTGAPRY/ IK* ILLELKRQIES 
DT V I P E DDNTP L S ALNRP S RQ K / VDL I C 
T I EQMDL I D I YRT FH P TAS E YL F F S AHG 
S FSKKY YMLGHKI CLETFLN 


9990 


23891 


A 


10064 


100 


466 


LGILPLQSRPSGPGGQCTPTLPFWSLP 
L P VRGCFLHLCWCLP YSGALGS P CHTGL 
VCWHI PCF * CCGVS P *R\S CLE VHLYTH 
THTHTHTHTHTRFVLM*MRTPLRKGKRQ 
VMRL CGGGGRA 


9991 


23892 


A 


10065 


226 


377 


GLAGS PCI SGSVS FCSGCI Y \RGHGIMF 
I CND C KYFRFCKS K * NCFKEHN P 


9992 


23893 


A 


10066 


292 


458 


KVSS I CLKMS FLFLFFPRTGSHCH * KVS 
S I CLKMS FLFL FFFRTGSH/ S S PRLECS 
GMIIAHCSLKLLDSSDLTASAS*VAG 


9993 


23894 


A 


10067 


252 


36 


TGFNPLHFIYYYYYYYY\FESKSCSHAQ 
AGVQCCDHGSLKPQP\PSSSHPPTSAP* 
VAGS RNAHQANS TAW P MVS 


9994 


23895 


A 


10068 


56 


481 


G I LKNCHNLMI KLE RMRS C VLMGE QRK / 
W I Y * DE S T PGEDT V3STT VE I T I ND LE YF I 
NLVEKTVA/ GVETVDSNFERGPTVGKM\ 
I VC YRE I FHE KKS P S MQ QL \ YF * KF PQ P 
LQPSAATTLVS*QPS\SKQDPLSARRLQ 
FTAGSD 


9995 


23896 


A 


10069 


263 


435 


HFSFLSFFF*KTESHSVT/RVEGRGGIS 
VDCKLCRPGSRDSFASASRVPGTKGISQ 
GRG 


9996 


23897 


A 


10070 


116 


474 


RGAGAWGVSGRSSPPLSALPIQPPPHLS 
TPPPLETP/P PVLH * PLKDLGGWGALLA 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyialanine, 
G=Glycine, HNHistidine, I=Isoleucine, 
KHLysine, l^=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJnknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














Q*RLPG*HSPKKKKKKKKAPRPPYK*GP 
PKTPVDPPDGPFLDPPLFGWPPPP\VFL 
PGERPPPL 


9997 


23898 


A 


10071 


159 


377 


KSHMPLTLNQKLEMIKLSEE/GLMSTAE 
IGQKL/GLLLPNSQWNAKEKFLKEMKS 
TSPGNT* TI I KWPGAVADA 


9998 


23899 


A 


10072 


180 


473 


AIEIQLLSRPLCLWPSGRFPDAQVANIR 
VRVAV * GNPLR * VGPAE P GE QR P S P WGF 
PSIS/WDISPTWSSSASPPGLSADCKFT 
CHYRCRALVCLDCCG 


9999 


23900 


A 


10073 


98 


440 


G QLN KXj S GP Y P ENVG YT L P VL INPLAQ P 
VIYSTIFAGTLITALSSH*FFT*VGLEI 
NMLAF I PVLTKK\ INPRSTEAAI KYFLT 
QATAS I ILLIAILFNNILSGQKKKKKGR 
PF 


10000 


23901 


A 


10074 


358 


404 


P CL YT YHLHNSNST * SLRLVS IMIEIVT 
SNLHRTVPHSYTPPTQLS INLVMAIHLS 
AGAWI ISYPSMIKNALVHFLPQGSPTPL 
IPILVIIETISIiLIQ\PRPLAVRLTGNI 
TAGHLLMHIj IGS ATIiAI S T INLASTL 1 1 
FTILILLT 


10001 


23902 


A 


10076 


3 


419 


KWRKKMWCLQKMEYYSS*KGKKLLSHVT 
I WMDPEEVMLS /E I S /SVTEGQ I LFNST 
YV 


10002 


23903 


A 


10077 


335 


414 


DKSPF/D*RFFLPPNSNGFLVKMPPSVK 
RL.PFFFIFPFSFFFFETESRSVT\RLEC 
SGTGLAHCNLRLPGSRDS 


10003 


23904 


A 


10078 


279 


1 


KDTQLPFIQFFFETGSCSVPEAGVQWCD 
HSSLQPPPP\GSSGSSPLSL*VAGTIGM 
LIFKI FSRNPLNFFFFLRWSLALSPDWS 
AVARS WFT 


10004 


23905 


A 


10079 


397 


3 


KSSSLFQNHLWANSSTYSHLIPTLGYIL 
NHGNQFDPDTL/RDKM*NFFCTIVCPHY 
ELPSLEQW/VSLGSLNYDTILPLDLFCK 
RQGKWSEIPYAQDFMTLYQNLTICQT/P 
QNPPPPPKESSKVELDIIDDPPH 


10005 


23906 


A 


10080 


281 


1 


I FLGEWGPFFPPQKKSFFPKI PQWVFFT 
P PYRKKI FFFLPR * NWA/HPKI FFKRP P 
PFFFFFFFFFFFGFKENCFILLPPKEHI 
TQAPAWG FAG 


10006 


23907 


A 


10081 


207 


35 


Q ENRR C * RGRGE S / GTL I HGWRE CQT VQ 
PLWKTVWQLLKRLNTEFPYDLAILLLGE 
FH 


10007 


23908 


A 


10082 


2 


408 


IAPLHSSLGDRARLRLKKKKKKGKIRPL 
LALFFKNQNPS FKKNSHSF YGALMEP \ P 
PPHIPPFFFWKVQPPLFPFRGTLEFPGP 
IGTCKLQTLGSPG*WDPPCPLPHQSPGR 
GGL I PGRKGEDP PS FRGLP P PHF 


10008 


23909 


A 


10083 


195 


2 


ECKQRPQLEMVSFRRRSHEPFFFETGPR 
S VAQAGMQWH / DL S S L KPP AP G \ S GDL P 
TSAS*VAGTI 


10009 


23910 


A 


10084 


219 


1 


RLKIKLLKIIITTTIDSGICSFKSCKL* 
H/LKIQNMWkDKLAHAYNPSTSGGRGRQ 
IT*GQEFKTSLGNMVKP 


10010 


23911 


A 


10085 


364 


413 


KKKGGEKGPLLKKER* KTNGQFLVHTNF 
/RIiPGLKYFF/CPPPPSKWGLRAPPPKP 
GDFFFFFFFFLVFLVETGFHHVGQAGLE 
LLTS * P I PLGFPKCWDYRRE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G— Glycine, H=Histidine, I=Iso leucine, 
K==Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V— Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10011 


23912 


A 


10086 


200 


3 


I NF I LLL F I F VLL I F S TG PCS I AQ AT VQ 
WRDHGTLQPQPLGLKQAILP/ASAP*VP 
RITDVHHHARLN 


10012 


23913 


A 


10087 


154 


329 


F S L S D F KT Y YKAAGHRGS CL * YQHFG KP 
\RRTDPLRPAVQDQPDQRGETPSLLKKK 
KKK 


10013 


23914 


A 


10088 


157 


390 


NSSSFFP PS CGAQ/ KSEVKKS SGPHSS S 
SSG*RGE/ CLPLLVSGGSWHFIiAYGSIjT 
P VSACVFT* P FSMS LCVYIQ I S VS 


10014 


23915 


A 


10089 


305 


1 


KKKRAQSLPGQKGGRGFSQPVTAGDRSF 
RP K I R KE ALGLGQ LGIiTDL CRT FP PTAT 
G*TFFSSAPFPG\YPPV*PTKKKNTNLN 
*FTRIKI ISRIFSGHEK 


10015 


23916 


A 


10090 


329 


2 


RGFFQPLGFWDPGP ISNLGGPPKNLLLK 
AEIMGYNFCVPPFPPGGLPTTHFFFPPR 
NPPF/CFPFTPSLGFFSPNPPGFFFFFF 
F FL S R VS L CHLGW S AVA* S WL S AAS 


10016 


23917 


A 


10091 


84 


408 


E KKGENKLFWGFI1LI1GTLF I GGYRFKI I 
FCLVGY FIFIYL FGEME / FS LLL PGL * C 
NGTCSVHHNLPLPGASGSPASTSGEGGI 
TGMRRHPRLY* I YLGRWS FAL VARAVMQ 
WHVLGS P Q P S PAW C KRFS CLNLRRGW Y Y 
RHAPAPPSLLNIQKI ILGFWLAPVY 


10017 


23918 


A 


10092 


184 


359 


FQ INS S VTRRS F VIiAAQAGVQ WRDLGS P 
LLPPP\GSSDSPASAS*VAGITGLHVGL 
SHK 


10018 


23919 


A 


10093 


298 


386 


KLiCEK* * ITPIRMAI I * KKREGENS \KC 
W * GCGE I GTLVYCL * E YKMVQ PL WKI VW 
QFLKILGIEL 


10019 


23920 


A 


10094 


310 


1 


ICMNEMILYKQDFGRTIIIRKIVIYFWI 
FN* FGH* I R I Y *NLLR YKTCML * RG* * I 
NLPQRSFKNLRLKI * * W/ W P GT VAHAYN 
PNTSGGRGGRIT*DREFETS 


10020 


23921 


A 


10095 


40 


407 


EHTHTTACEPGGI* IRPVD*MVLISWL* 
HGTITCKMVTLGETGQKPQGISLIIIFF 
FFG\TQSLFVPRNGVHCRGPISAR*KLC 
LPGPPLSPT*AS*VAETTGACYCTWFEN 
NFFLRLPLHCY 


10021 


23922 


A 


10096 


189 


2 


P P FFF FSNLFKKS F YEQGLALS PRLKCR 
GiriA/HCS/li*TPG/YK*SSCLSLP\S 
SWDDRCAPPRV 


10022 


23923 


A 


10097 


260 


401 


DHLRSGVQD* LGQHGETSSLLKVQKLDG 
RGGLQL I P \ RRLMHENHLN 


10023 


23924 


A 


10098 


187 


464 


W YMTR CGS LG PS S AAHE S HH P P P QQLMN 
P\PSPSSAAHESHHPPPQQLMNPTIPLL 
SS*IPPSPSSAHESPGRVGRPEFTGGAH 
GYLHLSPP 


10024 


23925 


A 


10099 


397 


1 


FFGGAPCPPPRRKGKTPPPPRKLFPPFF 
PKKPPLPKKNC*NKNPLLGPKKKNKRPP 
PGRFFKKGFF*NPPKAPLTNFWEKKKKK 
\SPFFKGVFFPPGEPKKPPPFFFFFFFF 
FFFFPTRPPRVRPRVRPRV 


10025 


23926 


A 


10100 


253 


3 


NAINRNKMQ * TKTS E WKVYADHNAT I KQ 
KQKRKCQVL/WRGCGETETLVACWWECQ 
MEQPAPVEN/W*HFLENIKIQLPYDPAI 
HE 


10026 j 


23927 


A 


10101 


121 


404 


VLASWS I FFFFFF WKGGLI FS PRLEGG 
G/LILG*LKPPLSGLGPFSCLTLPEGWE 
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SEQ ID 
NO: of 
nucleotide 
sequence 
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peptide 
sequence 


M 

eth 
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NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIan ine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaJanine, 
G—Glycine, HNHistidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LKPPPPDPGFFFFF*GKTGFPRGGQEGL 
NFLTWEPPPLAL 


10027 


23928 


A 


10102 


468 

• 


1829 


IRDTYTQNPDKAKGYFSC I FFE I EETTM 
SRKQNQKDSSGFIFDLQSNTVLAQGGAF 
ENMKEKINAVRAIVPNKSNNE I ILVLQH 
FDNCVDKTVQAFMEGSASEVLKEWTVTG 
KKKNKKKKNKPKPAAEPSNGIPDSSKSV 
S I QE E Q SAP S S E KGGMNG YHVNGA IND T 
ESVDSLSEGLETLS I DARE LED PES AML 
DMLDRTGSMLQNGVSDFETKSLTMHS IH 
NSQQPRNAAKSLSRPTTETQFSNMGMED 
V/RPRHQ* KS * VP I FEKS VKDLQRCTVS 
IjARYRWVKE E MDAS I KKMKQAFAE LET 
LINGSRKWALLAEMDKVKAEAMEFLLSR 
QKKAELLKKMTHVAVQMSEQQLVELRAD 
I KHF VSERKYDEDLGRVARFTCDVETLK 
KSIDSFGQVSHPKNSYSTRSRCSSVTSV 
SLSRPSDASAASSSTCASPPSHTSANKK 
NFAPGVY 


10028 


23929 


A 


10103 


254 


2 


KKKDFPLTPLGFGGLKIQLFFFLQKKKN 
FYFPP* XFPKIFFFFFFWFGTQSCPVT\ 
RLE C S DR I P AH YNLHLQS S S H S RAS VS H 
E 


10029 


23930 


A 


10104 


256 


419 


KVIELHTGR/ CAVGQHVGRPRRVDCLS S 
GIGDQPEQHGEPHLY*KNAQSSQAWW 


10030 


23931 


A 


10105 


153 


439 


TT VT S L C I Y YT I Y Y Y FRLL S THAYTH I Y 
THIYV*KLTIKQPQAGPSEGF*KKEF* * 
RRW / W VGT VAHTCNP S TLRCQGG WMT * G 
QEFETSLANMVKL 


10031 


23932 


A 


10106 


424 


157 


S SHLGRPRWVDHLS PGVPDQPWQHGETP 
S L /L KLAGGACL * S QL I GRM / RL S S GGR 
GCSEP*SHHCIPWETEQDPISKNKQTI 
FRMFKN 


10032 


23933 


A 


10107 


213 


3 


NYKCPPTPNLNTLKALYFKKLILSPKA* 
AIKEKID* /IIKIKHFYASNIINRMKRQ 
LTEWKKIFANHVSDKG 


10033 


23934 


A 


10108 


300 


1 


TPKQRGKKGSPREKPFPVNLPGKTLF/Y 
EV* FMKP L K* Q KKKWFQGFFF FFL I CGT 
VYHQFTSK* ERG VPGT VAHTCNP STLGG 
RGGWIT*DQEFETSLG 


10034 


23935 


A 


10109 


120 


421 


KKI KKKNKNHM I X \ S IDAE* AFDKIQHR 
FMIKTLNKLGPEGKHI / KIMKAI YGRGG 
QDGQLHTAKK/EPLPLRENKISGKP 


10035 


23936 


A 


10110 


102 


418 


TLVCVCVCVCVCVCVSLSVFLSLSLCLS 
QAGVQWCDHSSLQPRPSG\SGDSPTIAS 
WGGITGVHCHIWPTFYFLFLIRLKSLM 
L**TSSEMALTKMGVHSHGLFN 


10036 


23937 


A 


10111 


382 


2 


VNNATMPSKCSTEKSHMSLTSNQKLEMI 
KLSEKGMLKAKIGLKLGLLC/QVSQWN 
AKKKF*REI*NPTPVNIR/DMQKVLV/V 
KTEDQTSHNIPLNQILIQNKSLTLFNSI 
KAERSEEAA* EKCEDSRG 


10037 


23938 


A 


10112 


61 


427 


ENTIYQVQCSIKEEYLQRRTLKYASLFQ 
KH I CGS LAFLHLL * PKLYHSRSNAE SD W 
1 1 QLFSVQ PNVKE I S KHKIGW * RGHACN 
/RQHFERPRHKDHLRLGVRDHPCQHGET 
PSLLSLLKIHI 


10038 


23939 


A 


10113 


71 


399 


NVLGKEARSVGWEAVRGQISRVS IQMKW 
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beginning 

nucleotide 

location 
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ing to first 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=PhenylaIanine, 
G=Glycine> EMHistidine, I=Iso leucine, 
K-Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine r 
T=Threonine> V=VaIine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














KGLIKGEKLKMEKSQACICIYFFFFETE 
TNFIAQAGGQGHDLGFMEPPSPG\*SDS 
PA* APHEQNGL/TGMHHNPRVI FV 


10039 


23940 


A 


10114 


362 


1 


SKKKSWGSHSGSKCHTSFLSYPTKEQKS 
YLSLFYKCNLQSLTFKELLDHFRPVCLP 
TIY*NKNWP/RDLPLKPSFKTLML*NQT 
L/WPGWAHACNPS ILGGQGGWITRGQE 
LKT I KANTVK 


10040 


23941 


A 


10115 


314 


3 


WRERRCGAHLMRCASNMAHDKP SRMLNL 
* KPKMAT/RNTDHDNFW* ghgetei lnh 
YWWDYKIM*PLWKTI*QFPKMFNIKLPY 
VS YI QLLGI YPRNLjKINIHTKT 


10041 


23942 


A 


10116 


44 


394 


PGFQRVSQDGLDLLiTS* SAHLGLPKCWD 
YRHE PLRPAG I YS YPAVLFVLS T YRASN 
IT*MLMTPKPTSG\YSFLLSFRFSSFSY 
LRD I S AYMSLS CLKLNVFKSKP I IFLS P 
LWLLV 


10042 


23943 


A 


10117 


398 


3 


MESCCFMGIEFVLKEEVLEIGCRTM* I Y 
LHY * S LHLRMVKMVTSLLC VF YHN* TIF 
KKS S VFLYAGNKQL * I * I LKNYI YVMI P 
FT ITSRNMKRNNLTKDGKDMSTETYKTQ 
LKE I TD \ * NKWRDT TFMHQN 


10043 


23944 


A 


10118 


104 


386 


KVPVWATCGVKMHRLAGHGGWRIiWTRLP 
WRLRRDNRLS S GS RGA\ AAE I LP LCH C T 
PTWVTEHDS ITYKY* KKGAP CWRASS * C 
TLSGHALFFFA 


10044 


23945 


A 


10119 


258 


401 


YNYPFLTCVMKEKIGAGMAAHACNPS TP 
GGRGG* \TRGQHLETSPTNMV 


10045 


23946 


A 


10120 


310 


384 


GVFP FLTFLYGG I KGGP P PPRFFLF * FA 
FLFFFF/ETGSHSVAQANCSGSVSAHRS 
FHLPGPGDPPTSAS * VTGTTGICHHS 


10046 


23947 


A 


10121 


245 


479 


TSSLYRKIQKRFLYMLALEVLKICNTYF 
MNTERFPLKLFGQFS /FRI IS I * FFPFL 
FPKI YHYFLRDRV* LCHPGWSAV 


10047 


23948 


A 


10122 


268 


469 


NS VF WLGWAHA/ W * VN P S TLGGRGGW I 
TRGQELETTPANILPP 


10048 


23949 


A 


10123 


221 


423 


GQAGAVAHACNPNALGRPRREDSLRPGV 
GDQPGQHSKTPSL/liKSDNF*KMGWLGP 
WAWWLTPIIPALW 


10049 


23950 


A 


10124 


73 


183 


AGGGWALTNTVINPT\PPSTPFTPCLSY 
S CT I S TASAFRF PG \ S TS TAVLTHVR I L 
KSTPDAPHPPLKIPRARRNIQRDHTLSS 
NLFYYNHQLNVTEELTYYDLT* INPTAT 
IHAIHTMPLLFLHNLNCISI 


10050 


23951 


A 


10125 


342 


2 


PNLGPCPPPRGTKGGPFFPSFHPSI\IG 
PPNFFSF*RP*FF*MLKSLGVP/SPTGP 
I PKFPFFPGFKNYFS PPL\ PKGPLLKPP 
RALFFFFFFNRHRVLL\CPGWSPTAGLK 
RSSC 


10051 


23952 


A 


10126 


85 


442 


KKVINI KCN* RVWRTGSL IHCS WG * KMM 
PHWRAVWQSFR/SLSMYLSYEPALPLPG 
SYPRALRTTCVCTHSRCFSLLELPQA/W 
NH 


10052 


23953 


A 


10127 


180 


484 


KS FFFFSQSAQP I KVI PAP *NFHFFGVK 
RSSCLSFLRKWGPRWVPPPPFFFFFFFL 
I FVEMRSHYAAQAGLKLLASS \VPPPSG 
IPRTLD*SYPD 


10053 


23954 


A 


10128 


376 


1 


WSPGAQLLFISCLYYNLWSLHTGRTPAK 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














PP *GWTLQKTTPLS CPAP* LPPPLPS PL 
QQIDLKEKLVFTAPACSL/PHSPNSSPW 
PQRRWPS VS / P / P VAHPAAPRLPPP P VC 
ARPLWLPEAPLLG I PPH 


10054 


23955 


A 


10129 


140 


454 


CLIRSSINSVHNNIYIHSTPDRHLGCFQ 
FL VVT KNAAVS RLAYVF * * I * IPTYDHI 
HTHTPAG Y I P RHG I AG S QL TN I FNFS S * 
LiSSNSF\QVGCTSYFLFPSHI 


10055 


23956 


A 


10130 


340 


473 


AGARHSVLRPRPPLAGSSCPSS/CGGHC 
VALGVSCPGGGCRAACVPGCCCLLCGLP 
SDVCVC/SVCWGGSMGSRLGGWAAPGF * 
GNSREGPGNCSSGRKAGLAGGGLGPAWA 
EQAPDILCCGPGPHWLGAPAQVPLAPPL 
SPPPCLYGCLYSDRGDL 


10056 


23957 


A 


10131 


280 


2 


DSIALLPRLEYNGVNVAHCKLKSSWIRR 
SSHLSLPKHLDYRSMSH\LPGLEIF*YS 
YCK*CLLTSPYFEFLMNSLEDISSCFIP 
IRVMPKCRMK 


10057 


23958 


A 


10132 


244 


3 


IIS TDL I SHSWTLYL I S S L VY VFLKQHS 
IiAVLPRLDCSGMIIAHC/KLQTPGLKR/ 
FL / CLS LPS T * D YRHTP PNRANF YY Y YF 


10058 


23959 ■ 


A 


10133 


272 


3 


VGKPRVFPYRIAFNFGRVTS I LGPH I KK 
YHFQCRGGKHCFF* KAPPFFFF\FNETG 
FCSVPRLECSGTI I ALCS LNLQGS CNS P 
TSVSQV 


10059 


23960 


A 


10134 


217 


3 


FPRFRPLVSPALTELGQF/HFLTRGL* K 
KKPFPFLFFFFSETESCS IA\RLWGSGM 
I S AH CNLCL PGS S RTRG 


10060 


23961 


A 


10135 


229 


3 


KNDYLKCWQGCGRTETQMLGCWDIK/LI 
QP/LWKI IWQFLLKLNMHLLGNPTLPLL 
GIYP*EMNVYVHTQTCICTDAW 


10061 


23962 


A 


10136 


84 


352 


RREFKTGLATWQNLVSTKKYKTTVRYHL 
TSTRMDI I KGKR * Q VCW / RKC / GDKGTL 
LDC/WMRMLVQPIWKIV*VWKFCRK*KL 
NLFFPPPPQ 


10062 


23963 


A 


10137 


280 


2 


KCWVPEERI LRE KS QHLGKI KQEGRLN 
AWL I Q PGQHGE TL S LQ KI /QLAGHGGA 
PLR S QLHRRMRRE DHLS PG VR S RS E P * * 
LSLHWVTEQ 


10063 


23964 


A 


10138 


2 


386 


HI QE KQ L PNKD S TLNP YFL L I S I L KRKD 
EKIQMPSKHMKRCSTSLVIREMQIKIRS 
HFPH * VGKI LKS AN I KRW * MCRKRG\TL 
ITCW*TYK*TYKLVKTF*RIILQHPLLK 
THQSYYPAIPHLKETQA 


10064 


23965 


A 


10139 


229 


1 


TRSYHESLTILHTNHHLNCSPFPQAITP 
*SRFPYSNRA\WLGSVAHTYNPSTLGGR 
GRRNA*GQEFKTSLGNLPK 


10065 


23966 


A 


10140 


261 


429 


LSHLFHVALI * LGLVW/ SFVFVFFETGS 
LPRLECSGVITAHCSLDLLGLSGLPTSA 
F 


10066 


23967 


A 


10141 


377 


34 


WVFGSSGPPPPGFKKFFSPPPPKNKNPR 
GPPPPPIFFFFFCKKGVSPLFLPPGGGP 
/HPFLSPPPPGVKPENFFDPGGGGFP*T 
KIFSRPPP PGGKQNF FFLKKKKKKKEHQ 
IKR 


10067 


23968 


A 


10142 


208 


424 


FFSLFMQTFHIHIRFYFFLAGGGGAGVD 
RVS / LLSPRLECRDTISVHCYIHLPGS K 
QFSCLSLLSN*NYRHAP 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyialanine, 
G~GIycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Q=Glutamine, R=Arginine ? S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10068 


23969 


A 


10143 


1 


297 


P TR S P Q VS NLNH KS F I S FY L F I DLL SVC 
LSVSVYFETSLTLLPRLECSGAITTHLS 
LLSS*GHRHAPPCQV\FFVETRFHNVSQ 
SGIELLSSSHLPTPA 


10069 


23970 


A 


10144 


185 


488 


SGFPRGTEL I GAVC VCVCVCACVCVC I K 
LGYI*NHKRIYI*LLYIYSTYYIFNLSL 
* LS VY\ S YVNRNSM* DRSQS I * I FILPK 
LKTCLENTHTLKNTHEGG 


10070 


23971 


A 


10145 


283 


3 


LGWPGRKADGSGKLTIDYGRLDKLVSP \ 
I *SAISDMASTTKAVLQAQRDQYSVLDL 
AHAFFSIR\NQSQFSFIRDSPQYAFTLP 
SEEHLWSLDGQ 


10071 


23972 


A 


10146 


146 


380 


CFLWDNFFFFIFTEKTGSCSIAQAGAGT 
Q/WVGIISYCSLKLMGSRDPPTSLSCVA 
GTTGMPHHAQVI YFFI FL * RQGL 


10072 


23973 


A 


10147 


275 


3 


I KGMLKAKTG* KLGLLCQ/ TSQVENAKE 
EFLKEIKSATPVNT/RMIRKRNCLIAEM 
EKV*VTLIDQTGHNIPLSQSLPQSKCLT 
LFNSKKAER 


10073 


23974 


A 


10148 


281 


2 


KI FLPQP PQEMVFQGFAPPGG* I FLVF * 
*KQPFPPLGRFVFK/LPGGIPFFFFFFF 
FFFFFETESHAIT\RLECSGTISAHCHL 
RLSGSSDPTRP 


10074 


23975 


A 


10149 


413 


139 


SLRVWPIFGLLQANKKKFFWILFLS/IR 
PLFFFSRERGGTPLFPPFFLRNPGKKSF 
PPG*PPPPLKIQFPGKYFFTSRPAPFFP 
LFCQPPGKNLNYWGLGPGFPKIFPPPPL 
FFFFF*DRVSLCRSGWSKVARS*LTWGK 
AFFTWVS 


10075 


23976 


A 


10150 


100 


431 


GPRL TDHLRSGVPNHPGQHGETS FLL I / 
HNYSGG*GRKIANRLNPKGGGCSEPKSC 
HCTPSWGKKRMFVS KKKKKKRE 


10076 


23977 


A 


10151 


134 


417 


GLAAPLVG/ WGKRNP P P * PFG * GEKTGW 
GPHPGKIFLKKKLLFFFFF*KGGLSFPP 
GGRERAHFWLKKTP P SRKKE ILPPPPPR 
RGGGGPPPPPFL 


10077 


23978 


A 


10152 


403 


2 


LAFP I LQE VPQCFCNLLHSLRMYS STGS 
KLFPLASPQPHSLIiYHLTLARAPlSriLEE 
LQ S PNTALLLK YL * E AALKKKS R * H I T S 
SL*KVDLSFFLETKPRSIA\RLKCSGTI 
SAHCSLRLLGPSNSCATASQVAG 


10078 


23979 


A 


10153 


382 


297 


GLN*ESWSGEKGTL/LHCWWECKFVQPL 
WKMV*RL 


10079 


23980 


A 


10154 


209 


15 


SSPKF*FFFLSPKKIPLPFF*PIFFFTK 
TP/HFFF/SFFFFVCVSGSHSVPRLECS 
GAITAHCSSTS 


10080 


23981 


A 


10155 


153 


422 


RTSGKNGNRGQI ILVTMVTNWLFS FFFF 
PLKTGPHFVIQAGGQGRAHGSLQA* IFG 
\ SGDLLALVS LQAGTAGFHHHAQLS FFF 
KKTGLF 


10081 


23982 


A 


10156 


257 


3 


RRLQNSKDCCLFFPLEAPRGATARCQPE 
LSCMRCLSA\LLGGVSQSGYMGVRDPHK 
EAV*PLAELKR*AGRSSAPFKAVRQGCL 
S 


10082 


23983 


A 


10157 


37 


386 


DATRFRINS VGTAEGGLDRI E PNDLQ * Y 
INLVGTAVRRPDRIESNFESSTVGKKLS 
NNIVCYRSFFCERESQLTQQTSMLYYFK 
KLPQLPQPSAITTLH/SRQDPPSARRLQ 
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amino acid 
residue of 
peptide 
sequence 
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ding to 
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peptide 

sequence 


G=Giycine, H=Histidine, I=Iso leu cine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine 7 V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X~Un known, *=Stop cod on, /=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














LGQ 


10083 


23984 


A 


10158 


407 


272 


HKGGKRERKRKKKEKKEGGREKGQERRK 
ERE/RERKKGRKREMKKVNRYSKCTQ* * 
LLSN 


10084 


23985 


A 


10159 


248 


478 


VKWHRLLDDPSVEDPWKKSLKGPLQP* 
/RL*IFFF*NSGLLCHPSWRAVAQSRFT 
GVSTSQAQAILPSFSLPSSWDR 


10085 


23986 


A 


10160 


458 


129 


GFSVLNRLEDEDLRQENLLMPAV*DQPG 
QHSKTPISTKNNMNPAGHGGVYP*SQLL 
RTVRPEDHLY/ SNKPCSYLCAPAWATET 
LSLKNAFS CCSNHYGLS FQTS FHR I LMF 


10086 


23987 


A 


10161 


82 


291 


YS KTKSL FFAGHGGTC I YSQLFRRQVQV 
RQEDHANPGV*GCS /DTMIMPMNSHCTP 
AWAA* * D P VS KKKKK 


10087 


23988 


A 


10162 


6 


409 


LRINILPEPISLSPLTAPAVFMPWTNYS 
SCDTFSLASMTLLPIiHFLSTSIiALPWSIi 
FLAPS L FQ I S KHQL YLS P VLLTLS S CY F 
LNPCFRNHILMNLGRPRQANYLRSGV^D 
QPGQRGETPSL/LKNTKITWPWW 


10088 


23989 


A 


10163 


249 


415 


APPECSLPLLVLLLLLLLLLFLRWEDHL 
GPGG*GCSEPKSYHCTSAWVTR\GDPVS 


10089 


23990 


A 


10164 


1 


405 


RSEWRLRQPERQSETVKKKKKGFFFFSR 
GGGGGKIFGLMEPSPSGLGQFFLFTPLE 
NGGYRPPPLGGPFFFFFKKGGFPLCGPG 
GFEFPALGTPPLLFPKGLNFRGGPTPPA 
RFFFFWGPVFSVFS PKN*NPPVFRGKNG 
KNRPPKKKKPGRG\WGPPRKFNPFGNKR 
GGVPRAGNS KPPGPQRGNP P FLKKKKKG 
PPKGGGR*PPFSRGVNKKNCPKPEGEGS 
INPKIFPPPPPREKKKKPFFFFLTVSLC 
RSGWRNLHSL 


10090 


23991 


A 


10165 


175 


404 


AIKMVWIQAVPSQCLHGVGFYLPRSVCV 
KTP*NKRHRQGWAHGCNPSTLGGRGGW 
I \ RGQE FETS LTNMVKP 


10091 


23992 


A 


10166 


317 


2 


P FKNS PLLLNEMTN IHFLEPTCNYVNVS 
PFLK*HKFLFF*EMESCSVT\RLECSVT 
I IAHCCLKLLGTSNPPTSASWAPGMI IG 
MRHC PAP KVC S HAS AHAS AHAS 


10092 


23993 


A 


10167 


325 


2 


KMGEKQGHFIKEEIRKANKHIERCSRLL 
AIEEM*IKATVTI/RKHQTKC*QG*RES 
GSLVHG* *KHKIVHPLWKILSVSYKTKH 
SI IIIGYSSVLLGTVLVHSGSDAW 


10093 


23994 


A 


10168 


245 


3 


GEPFLFCFPIKKPKVFFCPYFNQAVFFL 
ETHRGFFEFFFLF * KKKKKS / LGMVAHT 
YNPSTLGSRGGQIA*AQEFKT I LGKTMS 


10094 


23995 


A 


10169 


164 


1 


MNKMS Q YT F P YKN * QWGG \ S VAH P YNP S 
TLGGQAMQ I T * AKE FETS LSNMAKP C 


10095 


23996 


A 


10170 


214 


419 


LS * PP PP FPGFKKFNRVLRF YGKS KPGP 
/FMFVALFQKGGPFYQNIPPPVKLEPFP 
PTII i RVSFFFL*DGWLCHPGWNAVA*S 
QLTASSTSRVH 


10096 


23997 


A 


10171 


168 


1 


WTWWLFPGFS ITNNYCSHFSSLHLAIW/ 
LGTVAHACNPSTLGG*GARGQEFETSLT 


10097 


23998 


A 


10172 


139 


3 


PAH/CES*PPGLKRSSYVSLPSG*DYR/ 

htpph\qlifvfwemgfhqpa/sa/fp 
kc*dyrs\dhhkwpt*hlk/ sfhfikk* 

N*LKRYLACSRLWYLFCLIRCMV* *NLD 
DLSILYLHLNKFQFLFIYLFFEMES 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A—AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=GIycine ? H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
. N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop cod on, /^possible 
nucleotide deletion, \-possible 
nucleotide insertion 


10098 


23999 


A 


10173 


253 


2 


FPLKICNTLPPQIFSTQRGLLSLPFTIF 
LLSLIIjLRPFFFFF*TESHSVPECSGMI 
SAHCSLRLPGSSDSCASAS\QVAG3CTCA 
R 


10099 


24000 


A 


10174 


300 


3 


GRAF FFKGLVKWVFLT PHP I PFLLKNFL 
KRVFFLAP S EKFVFFKNWP P * LS FL I FG 
EKNFFFFFFFFSEMESCSVA\RLECNGM 
ISAHCNICLPGSSDSP ! 


10100 


24001 


A 


10175 


117 


381 


EILRYIVSIRIRRTNLKFQQEELRVNIK 
NNLSSLGNMVRPRL/ECSGVITAPTAVS 
APGFK*SSHPSLPSSWDHRRAPLHLGDF 
FL I E TR SHH VAQAGQ VI FNVYS KFLLLK 
FKISPSDPYRNNISQDFLAIGPTNMVPL 
QPYI VLCACRTKNNKI IDLDSEQSSRT 


10101 


24002 


A 


10176 


194 


412 


TLKNLLLSLHCLSFVPTDLYQPERLSKP 
PRPAKKYKFLI L / WLGWAHACNPSTLG 
GRGGWIT*AHEFDINLTK 


10102 


24003 


A 


10177 


100 


339 


VP VKS LLL C VC VC VLLL * R I S DHL P TL Y 
ANLG*CFFSYLVISYS\NSTYCILQRHL 
LAIQKQP/CSFLRQyHLVGWLHKKKHTH 
THTHTHTHTHKSNTHTHTHKS KDLTGTQ 
NTTEFLLK 


10103 


24004 


A 


10178 


229 


2 


KEKYI KSKLHFEARHNGSHL * SQHFGRP 
RLMDCL\VQDQPGKCGE3NTPYLQKNTKIS 
QAWWLVPWPVYSHRASKTLS 


10104 


24005 


A 


10179 


368 


2 


SGSSPSTSPQGRPDPLGLFSAKIVLGQL 
ATI PLPLCFLLKI FHPLTLP PLLLLGGE 
FNLSPQQQDSISLPWFLCYRYGPTYKAF 
FYV*ASVPEKl\FFFFFF*DTVSLCHPG 
WTTVAQSRLTA 


10105 


24006 


A 


10180 


398 


1 


QNIMLYTINTYNFICQLKKKTIQWGENF 
F FNKWC * ENW I FTCKR KNTLHP YL I P YTK 
TNS KW I S ELNL \ KSLKVNRGGNLHDFQ F 
GNGFLGDTKS KTK/ AKI S KLE FIKI KNF 
CVLND I I KKVKRQLIEWKKYLQ 


10106 


24007 


A 


10181 


252 


419 


GARFKGSNFTS AGGQG 1 1 FFMGPPKLI S 
RP VF *HRGEGKTPGVTQLNRLEEHPLFA 


10107 


24008 


A 


10182 


101 


380 


FCWIYNSWLIGYLFFPFRTLNILAMCGG 
S CL * S Q Y FGRQRQADHLS PGVRDQ P GQ Y 
getps/ltki S WAWWHVPWPATQEAEV 
GGSLEPREVK 


10108 


24009 


A 


10183 


2 


377 


PGQVDCFSSGVQDQPDQYGEILFLLKLQ 
KLGGHGG/IR*EDHLSLGGQGCKEPRLY 
HCTPAWVTEQYLVSQKKKKKKKK 


10109 


24010 


A 


10184 


140 


3 


DG VLLSLPRLE CNGAISAHRN \ SPS CDS 
PASASQAGCCGTCP*SQL 


10110 


24011 


A 


10185 


364 


3 


SVCPPRLGGVSQSGDMGVRDPFEEAV* P 
LAELERCAGRS TAF FRAI R* AHLSLLK\ 
FTRNHPFPHVPCPRVMSR* FGLLPFFQR 
CPAQRGE I WRRS LAAVALRS CGGRLQ VR 
TCWQLCLHR 


10111 


24012 


A 


10186 


2 


249 


S VRLNKNKENTQ I RNKKGDT INLTE I KR 
1 1 KEYS E Q L FANKIjH/ S LDEMNKFLE R * 
NPLRPGAEAQRLVS * RFWRATAI DHMRP 


10112 


24013 


A 


10187 


270 


2 


PWAQRGCVFKTNQKI FFGFFE I PHLD * P 
CVFQRPPS\F*VRLIFIGGGFKPFFFFF 
FEMKSCSVARLECSTISAHCNLRLPGSS 
DSPASA 
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nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 


M 
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SEQ ID 
NO: in 

USSN 

09/515,1 

26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, 'F=Phenylalanine, 
G=Glycine, H-Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possib!e 
nucleotide insertion 


10113 


24014 


A 


10188 


428 


28 


P I FDKG / E *GN* WGKKS I FNKWCWEKC I 
C/ KKLRDSSLTSY / TRINLKWI KDLNVR 
TK/STKLLEENIDIV/IYDLGLGIGFLD 
IVITLICARATK/AKIEKLDKIKIGNFYASK 
DPKKMKR*FIEFIEQGKIFTNHVSDDTS 
I 


10114 


24015 


A 


10189 


85 


243 


DRVS/LLSPRLECSGMISAHCSFDLPG* 
SDPFTSVPKSSWNYRCPPARKLSFT 


10115 


24016 


A 


10190 


96 


440 


LQSNHINTLNVNRLSSSMKRFRL.SDE I F 
\KKHGPALCCL * EMYYKDKGTCTLKI KG 
WKNIYHANANNNKT/GMVLLSGKIRLES 
N/TVIRVKEG*LVI/INGRLI / QEDLIL 
LFILNVYAP 


10116 


24017 


A 


10191 


137 


423 


AKD LDRQ FS KE D I HMTNR YMKT C / S GRQ 
LIRETQIKTTMRYPLTPLDLRKSKDKKS 
* * * YEQKGTPAHC* *ECKL VQPLQKI I * 
TFPKLLKINL 


10117 


24018 


A 


10192 


221 


72 


AKVK* VWFFCFLFFF*DGVS /HVWAQ 
ECSGAISAHCNLCLPGSSDSVRVG 


10118 


24019 


A 


10193 


460 


121 


SDKTKYW*RCRAI KTLTHF * LKSKLVPL 
L*KTI*CYLAKLKIHIVFDSAIPL*DTY 
FR/E* FCICAPKTYSNMVIEALFVPANQ 
TS KKKNWK* LS CPQ I VDNFLNI HLMKYH 
LE 


10119 


24020 


A 


10194 


393 


2 


ATMPS YQRLLLTAVHELES PE * MDQFLD 
TYNLPRFNQEE I ENLISTS P IMS WE IQSVI 
KCLPFGPNPGPNGFIIEFYHTYKEE/LP 
ILL*LFQKIEG\ILPNSFHEVSITLIRK 
TDKDAT * ETHKP I S LMNTDA 


10120 


24021 


A 


10195 


3 


353 


EAAPMPVKFFQLLEPSGPPGLSMSCLGD 
CNGYACFS VSLHQSANQS IDQ/S I YQSA 
IHLLIY/VSIIY*SI\YLLL*SLI*PMY 
VPIYPLS CL,S I YLS FMPLS CLS FNNLS I 
TRHGSMCL 


10121 


24022 


A 


10196 


118 


395 


DFNAGL L P S YQ VL S L YFC F * NG V \ CHC T 
P/RLGDRPRLR/ PPKKKKKKTPPPPLGG 
SPPGGEPYTPRPGVGPPPPPFEHPPREN 
PGCFHPPPKAWP 


10122 


24023 


A 


10197 


437 


160 


FGYPRVFPLPALLKRGPGILFWGPNKKN 
YSSPARGSKICFFKRGPPFFFFFYFFF* 
QTESCFVAPVGVQWCDHGSLQPQPPG\S 
SGPPPSTS 


10123 


24024 


A 


10198 


259 


3 


RQGNHVTHVYS SRIiQLILS PPASLAVF I 
QKIYVNLCTYIHIVWLYA*MCICVCICY 
TCIL\ ICI YGLVAVAHTYNPTTJaGAKLG 
GS 


10124 


24025 


A 


10199 


1 


416 


I I SAHCNLHFP S SWDYS RTP P YP ANFC I 
FSRDRVSPCWSGV*HHLEQHSKMLSKK* 
NKIDMHGGMCL * S * LLK/RAE * DVLRQE 
NCLNPGGRGCGEPGSRHCTPAWATRAKT 
LSKAS YAS S SS S PKKKKKKS PS 


10125 


24026 


A 


10200 


369 


3 


TAPPFFFFGRDGLRTSLIIFPSRArPYS 
LVTSGS FLPAGAGGRGLSLE I RVLDLVQ 
DGCPCSQAWAAALGGPGWWAPFLCFFET 
ASCSVAQEGMQ*CNLGSLQPPSPG\SSD 
SPASWLWTSK 


10126 


24027 


A 


10201 


314 


2 


LHHMRPEV* FTPSKLGVTGYMVSNISYP 
WRCRAHK* SQVLVSLRHKHLFPLEMESG 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=iVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














S EVQL \DTNRCI SANCNFCLPGS SNS PT 
SAS*EAGTTDSRQPDAWADAW 


10127 


24028 


A 


10202 


68 


413 


kigillkth*psic/vsitnf*yndtge 
lkvkg* kni yyaninqk/ kagmvmfvld 
etdfrakk/dyqgh* imievsvhke /e / 
iivi lnii yasnkrvtkymk * kd td i kge i 
dksttivgdl 


10128 


24029 


A 


10203 


351 


1 


PKFPPLNPAWGTTIKIFFKKKNAHNWFS 
P RKE KALQ K I HKKG / RKLAKE I S FLGG F 
* TDYKTNPDSQS I RTLGE KKKK I RPGT V 
AYAYNP S T FGG * GRW I T * GQE L KTS LAN 
MVGRV 


10129 


24030 


A 


10204 


2 


392 


FLEGVLLLLPRLECNGAIPAHHNL.HLPS 
PRFKRLSCLSLPSSWDHRHAPTRPANFV 
FLVETGFHHVGQDGLDLLTL*SPGLGLP 
KCWD YRH \ DHHAQ P S F 


10130 


24031 


A 


10205 


264 


2 


PLSLSNIiFPlPLNSILKIFFILK*L*IH 
GKLHKMYRKV/LV*TSPSPSQC*HLR*V 
* HN I QNKKLMGLGAVAHASNP S TLGGRG 
GWIT 


10131 


24032 


A 


10206 


281 


2 


ISPPLFFLFSPPKKIFFSQKHIFLGPPL 
FYPPPLFFSPPP\PHFLWGFRPVPNPMP 
KIDPQTWAKNKDKGPNFPPPFFFILPPK 
KNIFFTKTYFFGPPPFLPPPPFFFSPPK 
KK/SYPFPLKKNSPNPPPS*K*AAISLP 
PPFFYFFFFFFFSFFLENTWKMADAW 


10132 


24033 


A 


10207 


340 


1 


STKKLVQKLGEHTQCWQGCEATGTLIHC 
RWECKLVQPLWK/SAWQILRQL*NFMHL 
PYNPTIQLQGSYSRE\QHTCPYKDLYVA 
ALFTIAKNC*QAKFLSTGE*INKLWCIH 
ITE 


10133 


24034 


A 


10208 


136 


3 


KNRGPPPRSGKFFPFIaPLFFFFFF*DRV 
\YPSHPGWSAMVPSQLT 


10134 


24035 


A 


10209 


327 


2 


INKKYYEQLYA* KFDNLDE I YQFPGRYN 
L P KLTQE KANNLNRP VS LRE T KS T INNL 
PKRKVPDPDGFTGELYQTFKEE ITP\FS 
TEAKGILPNSFCEVTI IVRLFLLRQ 


10135 


24036 


A 


10210 


89 


485 


E I VPLLRFPLVI FIAS PTTFYTYGFVSG 
EKSQYKKREKLCDKTLEQSRSLLGMHST 
CGIALHPKREGQMCTLCACFY* FFLKES 
HS AARAGVQ WHNLS S LQLP P \ PS S SNS 


10136 


24037 


A 


10211 


361 


462 


IQQIGSNTGNIPAMELALKRNSKNINRL 
AGWNGP * VFAC/PKNVLFYLHP * LFG* I 
* IYQLKITF*NYEGMVP/FVFRFLFETR 
SLLSPRLECSGTITTHCSLELVGSINSS 
I S ARAE FLQRGANL 


10137 


24038 


A 


10212 


62 


469 


SLLWKLRSSGASTSLRCLSA\LLGDVSQ 
S GYTG VRD P LE E AVCLL S ELE RGAGRTT 
ALFRAVRQGRLSLEKL\LLPFVQICPAP 
GGGIYR/ GQ*ALLSCGGLCPV*AS* * /M 
CLPC/EA*TRLLKPPPGS 


10138 


24039 


A 


10213 


197 


1 


WTQPNEWCVTNKDAS FQQRCTKKI KT/ 
WPGMVAYTCNPSTLGGRDGWI T* GQQFK 
TSLANMVKPH 


10139 


24040 


A 


10214 


31 


213 


HASLDEGFHHVGQDGLNLDTS * STCLGL 
P KC WDYRHE P / P APNWWKKFLS S KAFKR 
*LGCY 


10140 


24041 


A 


10215 


442 


29 


P T I NP TMAC KC LS KRKS HTS F TLNQ KLE 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=0Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TIKLSEEGMLKAEAG*KLDHVH/QVSQI 
VNAKEKFLKE I KSATSLNI * MI RKQKRF 
IVNME/KKVLMFWTEDQTSHNIPLSQSL 
I RKR P * L FNS LKAE RS EEAP E KLE AS S G 
G 


10141 


24042 


A 


10216 


195 


533 


LTOFWLRVTYGVAAKMSSGTTI IGRLSW 
GWRLHF \ KMAHL * G YW WE AS VPLHMKLS 
S CHGTG \ S LRVKD P RE RLRQ KLQ CF YDL 
VLEVTYWHFYCILFIRSKSLSPAHIQSK 
RI 


10142 


24043 


A 


10217 


274 


494 


CVLKGLPFALPWFFFFF*DKVSPSPRLE 
CS VAI /SAHCSLNLLGSSDPPASASQLA 
GPFGLRR 


10143 


24044 


A 


10218 


142 


480 


VTIKIYLHPHDTTTPLRQGGSRLKEILV 
S Y* QQKVETRPSHVI FFTVI STQFKKTV 
TTRSGAVAHTCNPSTLGGREGQI/T/RG 
QEFETSMTNPVK 


10144 


24045 


A 


10219 


163 


464 


AVYFSFWRVCEHDLASLYFENSGKRDNK 
FVFLVLQAQGPAWRQGEVHKHLLPCVYH 
FAFWLTQQPQS FF * DG/QSHSIiAR/ LE 
CGGVI SAHCNLYLPGS SNS 


10145 


24046 


A 


10220 


379 


469 


V * KVQM / W PGM VAHT CHP S T LGG * GGW I 
T*DQEFETS3jADVAKPC 


10146 


24047 


A 


10221 


211 


473 


VTWLLLFFS I FRAS S LSTDS VS YVI AFQ 
FYFYLFIYFYLFIFETESCSVA*AGAQW 
CDLCSPQPPPRGS\SEEPSTLQMTSGSS 
WL 


10147 


24048 


A 


10222 


142 


394 


GVCQSLLGSLCQLGGTGVRDQLEEAVCP 
LAELEPCAWRTLLLRICCSLQSQQAGIF 
/RFC*SYIHICPFPQVLLSSEMGVLSIR 
P 


10148 


24049 


A 


10224 


77 


396 


KRKTKKS FWTPSGFGLRPP I PKKKKFFN 
S RGKQGG FGRG S GS RGGGGS T S GRGGG Y 
MGKVFGMGQ*NFTGDQGLAKGGFVFVFL 
GRTTIGIKGALKRLFA/NNEHDIiR' 


10149 


24050 


A 


10225 


155 


443 


GQAREGTLKSQFP FPS P PGWPL I LPGGV 
F S PKF F F F F FETRV/ CAVAQ AGVQ WCN I 
DS SNS CAS S S * I AR I TGAHHH \ TE L I F V 
F*ISPPSQ 


10150 


24051 


A 


10226 


290 


485 


KRGE G P VQ KKKF F FFFFETES CT VAQAG 
VQWHNLSSL*PPPPG\SSDSPAEFPR 


10151 


24052 


A 


10227 


228 


424 


KLYQNLHDTIiFGFEIiSHYEEVH* KMLKG 
S S WE FVMHS S / VSNKCILI CS AKKKKKK 
KKKKKKKKKKKKK 


10152 


24053 


A 


10228 


269 


454 


DLFAASQGHAQLLEELMATTLSLTHTHT 
HTHTHTHT P WS L F * S R / SAL S L VC 


10153 


24054 


A 


10229 


255 


1 


FYIFSKFLYVWLNVSHLDSHICFCIQSV 
I ISCFGVFYYYYY\FF*RQSCSWHTGV 
QWCDRS SMQPQLQGQNDPPTSASQVAGS 
A 


10154 


24055 


A 


10230 


18 


472 


GPEEFGTRRERDRERERETRDSRERVER 
VRERER* /RQRERERERERERER 


10155 


24056 


A 


10231 


52 


361 


WRKNYYFQIKITVYLENTKDFIQKLHTK 
CW*G*KEIGRAIHCWWECKM/VIQPVWK 
R/VWQFLRKLNMLLPHNPAVELLDIYPN 
ELKTHVCTKALYEAGRGGSRL 


10156 


24057 


A 


10232 


3 


402 


APRLPLGGWISFSSPMGAMGLNRATGIM 
GSFFFFFLKRKSPLGPQGEGRGQNLG*G 
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nucleotide deletion, \=possib!e 
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NPGPRG*THFPG*TP*KAGNNGEGPPPR 
GNFGFLK3CKGVPQKGQGGSKP/RDPRGT 
PRLGPPKGLE*RGGPPPPAPQK 


10157 


24058 


A 


10233 


48 


470 


PGRRFTTAGLGKQEPAASGPRAPRPAQT 
PLKREAARPSLRHLPEGA*APRSSARSP 
LLIPAASLRSPRRRTCRRRLRHFGIITR 
PFQPGVRLPDSGPGLA/RWACAAVPAAS 
RAL P ALRR PL* RR PGHGAP S S KS HGHS R 
SP 


10158 


24059 


A 


10234 


169 


446 


CPPGPAPSHLLEDRDDPQAGRQSSFYAG 
WWDRWREKTLFLFLRQSLTLSPTTPFFF 
\ FE AES HS VTHS GMPGVI S AH * NLRL PG 
SNNPPTSAS 


10159 


24060 


A 


10235 


264 


2 


KLKP I FKMVFLKI MFMS YIME VFFF I ET 
GSRSVTQAGVRWHDLDSLQP* PP \GS SD 
P P TS VSQVGKWHKT WE AE LA VSRDHATP 
RAKL 


10160 


24061 


A 


10236 


306 


3 


SQDMVKSPKKKFP* LV* GLTE I *KIRAL 
VFNKKFFFFFFFPETGSTGSCSVA\RLE 
CSGT 1 I AHCNLQLLGS SNP PASASRVIN 
S I KKLFANF I VSANPRPT 


10161 


24062 


A 


10237 


20 


469 


QCGAIPRRGDHLRSGV*DQPGQHGETPS 
LLKIQKLSRHDGH / CRRLSHKNGLNPGG 
RG CSELRSCHCT P AWATR \ GNS VS KKKK 
S 


10162 


24063 


A 


10238 


313 


3 


TRPRPTQGPPHSGGRACLCIFSPHGPPQ 
TRLSTPVSPSVRPLGFQVLLSARHGFLL 
CQHGMF\PSLFTMSNITLWPGAVAHTCN 
PNTLGGRGGW IT* GQE FQ T S 


10163 


24064 


A 


10239 


274 


49 


KGNPFFFVETGSHS I AQDGVQWCNHSSL 
QP*PP\G\SSDPPTSASRVAGTTGVCPC 
EDKLSHIIiRRCYAQVFMLAG 


10164 


24065 


A 


10240 


144 


454 


VCKIKIIYLIVLLEDYIN*FVKSSL*LK 
S * K/ L I QQMWAI I S LPE CLNS WCLGKV 
G* KRCS QQGT VAHTCNP STLGGRGGM I P 
R 


10165 


24066 


A 


10241 


289 


178 


FNYS S I KNNTDGWVWWCLLPVI PALWE V 
EAIESF/CSRDEIPLSCPGWSPVPKLKR 
FYCLNL P * CWDYRQEAP PHPAI SIX FDA 
TVIELFS 


10166 


24067 


A 


10242 


352 


294 


PPRTTKMW*DQPGQHVEAVTLQKMQKL 
VERGGKHPQSQLC/REDH*SMGGGGCSE 
PRP * P CTP AWVTERDHVS K I K* NKI K 


10167 


24068 


A 


10243 


293 


1 


VSLCRES SNS FIWQCPWSHI LALCFQR 
EKKRLDLP/S*MVGK*EIPFLVPLLFLP 
HWPLLLMK/ YM* I F F FGNRVS L CHPGWS 
AVAQSQLTAASVPRR 


10168 


24069 


A 


10244 


218 


3 


LERH I P VRP III CD YKWVWRF / HFLRPL 
HN*FLFSFFSFFFFFFETGSYSVT\RLE 
CSGTISAHCNLCVHIjLL 


10169 


24070 


A 


10245 


185 


1 


KKQPQQTKNPLNNVKKEGKGQHFFFFLE 
T*SRSVAQAGVQWCDLGSLQPPPPG\SS 
DPPPS 


10170 


24071 


A 


10246 


2 


265 


AHKKKPAKHIAPSTFIjYGETEARRSMPK 
AAQLKCGRAGI I LFVFKTES CSVAQDAV 
LWCNHGSLQP*PPG\SSDPADSACRVAR 
TTSI 


10171 


24072 


A 


10247 


2 


430 


EAGGREALDGCGLTGRESGVQRRRDSMG 
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CQGTAGRGTRAGKEGPWRPEHLSRRPPA 
SSVLLSWAIHLAWVEGS *TGSLCDIjEAP 
GPTLSPWCSWSPELLQLHV/HHHTHTHT 
HTHS/HPNTHTDTQSLCHQPSFSVSPRA 
RTVL 


10172 


24073 


A 


10248 


201 


16 


TPPGGVFFLGFFFLFFFFETESCCVAQA 
GVQ WCDLGS PHP PAS RRPGNRS S I /AAA 
*GYHFK 


10173 


24074 


A 


10249 


167 


2 


GFDPFEGMLSYLEKEET/WPGTRAHTCN 
PSSLEG*GGWIT*GREFKTSLTNMEKP 


10174 


24075 


A 


10250 


92 


389 


FQPGQHGEILSLYEIKKI* /ELAGCIDM 
CLWS Y / LLiRRLKQEDCS S T * VQGCS E P * 
CLSCIPSWVTEQDPDSGRQI*KKKKISL 
GRRGGIFLFLKIWHVFFL 


10175 


24076 


A 


10251 


221 


415 


KKKKRGGRFKGSNFTS PGWRGN I FFMGP 
P KLNS RAG V * QRRD / WE KP GG PQ FNC FG 
SNPPFPPGGK 


10176 


24077 


A 


10252 


94 


383 


GKLNRCG I WLLGRCQRS P AVSNALWLHR 
VAP E E E AGE AWAGT LS LQL S CKS K I T PN 
KIVY* KKNKT / WLGT VAHVCNP S TLGGR 
GGWII*GREFETSIj 


10177 


24078 


A 


10253 


249 


2 


YPTMIFKCSSKRKSPTSLTLNQKLEMMK 
LSEEGMSKAELS * KLGLLH/QVSQWNA 
KEKFLKEIKSATPANTGMIRKRNGLIA 


10178 


24079 


A 


10254 


132 


414 


NNTYFQGKIFILNSQVKGMLICCWWECK 
* VE PLWKAVWRFIiK \ DLE QNS KE YKL * T 
WNQPRYPAWDWI KKMW YI YFME YHA\ A 
IKKNEIMFFAA 


10179 


24080 


A 


10255 


359 


3 


YSLRHNIEMRPVSNPTRASKCSHERRSV 
I SLTLNGKRE I I Q E AS KEGTLKANKP KA 
KL F VP VRQ WNAKE KFL KE AKS AAP LN\ 
RMIRKQNSIjTADTEQV*VIWIEY*TSHS 
IPLSQSL 


10180 


24081 


A 


10256 


283 


2 


SLQPWKLYVHVFLPKYATYPPFGTLPKT 
HFFHKILPSKAGSDLNSIYAMTYSHRHI 
TIFFETES HS VTRLE CSGA I \ S AH * NLH 
LPGSSNSPATK 


10181 


24082 


A 


10257 


216 


386 


PSKNPPPLINPFFFFETDSCSFA*AGVQ 
WRNLSSLQTPPP\GSNNSCASAS/RIAG 

I 


10182 


24083 


A 


10258 


263 


1 


FSQWL ILPSPLSYS FLFLRQGFALLPRL 
ECSGE I TA\PSSAS * VAE I TGACHHAEF 
LNF F VEME FHHWQ AALKP LG S S E P P T P 
ATK 


10183 


24084 


A 


10259 


248 


47 


I PKS RVKRENRLNP GGKGCN * P RS P P C S 
P PWVAKQNSLS KK/NKVKI YQNLHTETQ 
TVHGD I RGQEK 


10184 


24085 


A 


10260 


159 


2 


KRDMSSLKLEY*KKE/RIWPGAVAHTYS 
LSTLGGREWIT*GQEFEISLTNMVK 


10185 


24086 


A 


10261 


190 


405 


ASLHMFMYFFFFFFPKQIFFFVPQAENQ 
WGNLGKRNPPPPGLRDF\PASPPKEAGF 
RGGPH* PGLiFLVF* GE 


10186 


24087 


A 


10262 


39 


293 


PSSSTSRIYPEDTPLITQKYICKILFIV 
ALF I FPKHWKLCKCPHIG / E * I KLQC I Y 
TWRY YVAVQKNEENL YQLMYND YQE ILV 
SD 


10187 


24088 


A 


10263 


282 


3 


AFYNRFFLKI FFKRGPRGKEKKGGKKEK 
KKWGKKKKKKF F F F S PGQ I F FLGGVFF F 
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FLFFFEAESRSVT*AGVQWCDQ\SSSQP 
RPPRLKPFFQ 


10188 


24089 


A 


10264 


196 


376 


KKGGGHI INFPPHKGYKPGPEKKKFFFF 
FFEMES CS VA\ RL*CSGMI / SQAHCNLC 
LPGSSN 


10189 


24090 


A 


10265 


514 


157 


GLTLS FRLE YSGMMSL* P * LPG/ FR* FS 
RLSLPSSWFPLMYVYHYAQQSFCIFFV 
ETGFHRWQAGLKVLGLRQSAHLGLPEC 
W \DYRCE PL*LTKLI CFTYGLR * RSKP P 
FFVDGIQLF 


10190 


24091 


A 


10266 


57 


378 


RSRIW*DSVILCS*LVSV/VDQSYFY*V 
TE ILEQ I KLLGS KI MCFC I LQLHFLCFL 
CGSFVGS T\ IFI FSRNI WLGTVAHACNP 
STLGGRGRWIT*DQEFETCLASMVK 


10191 


24092 


A 


10267 


1 


212 


LIQKDTNIYYIKEAIi*IPNKHI/KKISS 
I VSHYKMQ I KIMR / YHTRMAI IKRTDKC 
IiYECRETRTLMHCW*GC 


10192 


24093 


A 


10268 


3 


399 


MANKHIKIFSASL1RMEI * IRATMRE/H 
* F I P I KWLVLKS KRE SNKC WQGCGRTG I 
VIHFQRECKIVLLIi*KTV*QFLKKLNIR 
LGAVAHSCNPSTLGCLRPRVQDQPGQHG 
KTLSLLE I QKLAWHGLHLS PS 


10193 


24094 


A 


10269 


190 


386 


FSPFLPLVTILECDKAICRRGKSFDFFL 
LGPSFYFFKLLL/FIFYF*ERVLLCGPG 
WSTVAQSWLTA 


10194 


24095 


A 


10270 


166 


327 


KTNSKTAEVCPLSVITLNINGLNSPIKR 
QR*TDF*NPI\ICCLQETHFRFKGTN 


10195 


24096 


A 


10271 


180 


1 


IFFLFEVETCSVA*AGVQWNDLGSQQPP 
PP\GSSDFQLIFAFLVETGY* IMVPSSS 
CRT 


10196 


24097 


A 


10272 


273 


3 


FLGLFSGNPLKANKPVCHLTLPILQRIF 
FFFESEFHS\VAQAGVQWCDLGSIjKPPP 
PG\SSDFPGRRR*Q*AMITPLHSSRGDR 
VRLRLKKK 


10197 


24098 


A 


10273 


399 


27 


YHFVPTRMAI IRRKMN* CW * TCGE IGNL 
VYCWWECK/LV*LLRKIVWQFLK/DVKQ 
NYHPVILLLSIYRI *FDCLFLCKSHVEM 
*LISGWVGAWWEVIGSREWFLMSGFTI 
SSCHQDCEFSQDLIV 


10198 


24099 


A 


10274 


275 


462 


KINLVLQLDFFFVEMESHSVTQAGVQWC 
HLSSPQPPPP/GSSGDSRASDS*VAGIT 
SVHQHAGL 


10199 


24100 


A 


10275 


2 


379 


RGPGGRGLGSGTCFHPGRGGEP I VYQHL 
F* FFGP \ PENYIL ILPGFGI ISH/ IVTY 
YSGKKEPFGYIGMA*AMISIGFLGFIV* 
AHHIFTVGIDVDTRAYFTPPYHTFEEPG 
YI ISRRRRKTKGFDF 


10200 


24101 


A 


10276 


2 


407 


FPSPSPSPFASVSLSLSFHGLPLPEHTW 
FIDGSPTKPNRHSQAKAGYAIVPSXSII 
E ATA/ PAP FTTC * QAEL I ALTRALT I AK 
GLCINIYTD/SHIIiHHHAVMG*K/GFLT 
TQGSS I IDVSLIETLLKAFLLPKEAGA 


10201 


24102 


A 


10277 


203 


545 


F L WRGQS LALLLRLE CS GS ILL / AH CNL 
G'PHRFKRFSLPQP PE * LSNW\DYRRLPP 
CPANFCTFSRDGFTTLGQAGLQLLTSSD 
LPTLAS PKCW \ DYRHE PLVHGLTFNRS F 
NFFF 


10202 


24103 


A 


10278 


250 


3 


AKWFLEMKSTPGEDAANIIEM/TKDLDY 
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D I NLVDE AVAGFGR I DSNFERS CTWKM 
VSNSITCYRELFHERKSRLMCQM*LSYF 


10203 


24104 


A 


10279 


333 


3 


SLILLLFQS MNHTHVRVT F I DQ F C VCS N 
YSTDWLFPHLSPISLRTPYSLRYNIKIR 
PXNHPTMAMASKCSSERESRISLTLNQK 
\LETIKL/SEEGMSKAQIG*KLGI J LCH 


10204 


24105 


A 


10280 


251 


3 


ITSKVIDSNPTTS I ITLNLNGLNTSF* R 
QRLL/DWASSPK*DSIICYLQEMNFKYK 
I FKKIGRRI YYANI SQKKVGVAVLTPDK 
V 


10205 


24106 


A 


10281 


22 


226 


TKINSKGNKELNVRAKTIKLLEENIDRN 
LCDLGLGNCFLD*YCIL*TKTNSKGNKE 
LNVRAKT I KLLEENI DRNLCDLGLGNCF 
LDMT PNAQ T TKGR \ I DKLDF I KI * NFWL 
DT 


10206 


24107 


A 


10282 


238 


383 


DAKRKVSSTEGDSEGVAHERRSAHLSAK 
PCPAEVE/AKPQKAAEKG* SSE 


10207 


24108 


A 


10283 


127 


377 


RIPGGSALGKEIRQCFGVKGQAEIGKIE 
MSQP TQEREWEG I GS QKAAQVDHLRSG 
VQDQPGQYGETPTLLKIH*LAGHG/GRC 
L 


10208 


24109 


A 


10284 


328 


370 


VGPGPGF*SSGRVSSCCPGWS\PSELK* 
SAHISLPKCWDYRCQPLRPATFFFKTAS 
HYVAQAKVQWCNYS SLQPGTLRLKCCI F 
KKIQWQW 


10209 


24110 


A 


10285 


341 


1 


KWCVLTAPLSGRFS VCLP FLGP P YFLQ 
HSDIEIKPINYPTVASKCSNERRSHTSL 
TLN*KLETIKLIEEGGMSKAETGRKLSL 
L/QSVSQWNAKEKFLKEIRSITPVNT* 
MI 


10210 


24111 


A 


10286 


199 


329 


LFFSFLNCLiSPHRIEESY*KEIKSAPPI 
TPGIPRK*TSLIAGKKNVFMDWKDQTNH 
NIP/LSQSKTITLFMSMRAETVEEAEEK 
L/ERGCFMRFKERTHLHNIKMKGPGWA 
HTCNPNTLGGQGRQ I T 


10211 


24112 


A 


10287 


358 


387 


YTNALLL YV* DQPGHLMGKDS LFNKWC * 
KNE YSHAKKKQNH \ P * FSLYTKTNSKWS 
KDLNVRAKP I KI LE KK 


10212 


24113 


A 


10288 


111 


373 


SAQCQPDQHGKTPTLLRIQKLAGPDGAC 
L*SQLLR\ * GCGE PRS RHC T PAWAT/ EG 
KTPSQKKKKK 


10213 


24114 


A 


10289 


197 


375 


I R IGL CKF 1 1 * FL E T KLHT VAKVE C S S A 
I SAYGTLSLVDSGDS PAS AS /RWGTTG 


10214 


24115 


A 


10290 


118 


371 


Qd J LMMSPGLKT*MTGRLRQENRLMPGG 
GGCGE PRS CMCTPAWATK\GNS AS KNKQ 
T\KNKQQQQQN 


10215 


24116 


A 


10291 


214 


368 


CGYHQKNKKTNPKNSKCWQGCGE IGTLM 
HYWKDHKQV/ Q/PL*TIVQQFLKTRI 


10216 


24117 


A 


10292 


2 


376 


PFPLMAQRPTHLVPIiSASSGTSLLMPST 
LLWLPWLYKAHAVPTVPLSKLFDHTKPY 
AVSKNNLGAIiSI*SY*VIPVPPRSRF/S 
FIRDKPTS 


10217 


24118 


A 


10293 


1 


379 


NILHNLDFFKNTTLEPQSSRLQRAVIAP 
LHFSLGNIVRRCFKKKSGWAQWLTPVIS 
AL WEAD VAGS LEFKS ETTLGNMMRP CUNT 
FKKKKKKAFLFGAPGFFFFMGGKILFGI 
AAPF*SSGRVSSCCPGWS\PFELK*SGH 
I S LP KC * DYRCQPLRPATFFFKTASHYV 
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N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQAKVQWCNYSSLQPGTLRLKCCIFKKI 
QWQDI 


10218 


24119 


A 


10294 


201 


384 


GEERLSVQCSAFK*LR*HINISLQTFFF 
LRRESHSVA\RLECSGVISAHCNLCLPG 
SSDSPA 


10219 


24120 


A 


10295 


161 


400 


VC VI ES VKANHLVDEHALKSLS FHLSLL 
FGV*GVCVCVCVCVYMCF/CVCIYVC 


10220 


24121 


A 


10296 


1 


433 


GEGGGGCSGWSGVETAGSRPAPGAEQDL 
LPAPALGRWQGLNSVCYPAWPQPNRRQP 
IALLCAAGLGPGQRLECSGMIMAHHSLD 
LPGSGDPPMSAG\SIAGTTGTCHHAW\E 
FHHVAQGGLELLGPSDWPPTVLGLQM*A 
TTPGH 


10221 


24122 


A 


10297 


209 


1 


LTFELLYHHYTLYFF*ESGSCSVAQARM 
QWCDLGLLQPKALG\SGDPPASAS*DA* 
STGVDYYSWHTCI 


10222 


24123 


A 


10298 


225 


1 


RPGMPLSALPCTGHAPARQPHSAACSGP 
KCPWCQSPQPAVGWTV*VCPLPLSPCLC 
/FCVPQPWHGGLPLPCPPV 


10223 


24124 


A 


10299 


212 


367 


VTAGSGHMCYTWWECRLEQSL* KTVWRF 
LK* LKVELRFD PAVPLL / S I YPED 


10224 


24125 


A 


10300 


97 


406 


LHRGTPAWVTERGCLKTTNQNNGQH1WE 
GEACS SRPFTT I C / EEK I HLVCALTEED 
*RIAAETTANTTD I PIGSAYRI LI KKLK 
PSKLSTC/YVAKQLSPGQLQTK 


10225 


24126 


A 


10301 


3 


422 


YRS S S KTFL I IRRGFIPSSTFLLAHLST 
TLSTLSHFQKGWVKLFFFFFFKKGFPFW 
PPGGGGRAKP*F/HGNPPPPG*KNFPAL 
PRQGGGKKG\GPPPPGQFWFFKKKGVSP 
WGGGAKTPELGNPPPGPPKGGGITGGPP 
PP 


10226 


24127 


A 


10302 


330 


1421 


RTKGRPIiDLALPVIiLLLFIiHLAFFSAPP 
GPSSLSSTEDLKWTLLSPAQSRKLQPQS 
LHRAG VRDLQWQ GRAGAE LTAAC / W PAP 
PRPETAVLLQGLPC/QEGRLIiliESLNGF 
ALWSAEGTI FYASATIVDYLGFHQTDV 
MHQNIYDYIHVDDRQDFCRQLHWAMDPP 
QWFGQPPPLETGDDAILGRLLRAQEWG 
TGTPTE YS AFLTRCF I CRVRCliLDSTSG 
FLTMQFQGKLKFLFGQKKKAPSGAMLPP 
RLS LF C I AAP VLLP S AAEMKMR S ALLRA 
KPRADTAATAD / S KMRS * LTLS E SRLPN 
*FEIWDSAAPGTLPSLPFFGAFDGQHIF 
CIHTRALRGS TRTRQCIAAALEGASFRN 
HVEL 


10227 


24128 


A 


10303 


172 


383 


KIENKVWQGYEEIETLVYCWWECKMVQL 
LWKTTOTVLKILKLGPP* \DS I IPLLGI 
NQGTE IRFCKRYSNT 


10228 


24129 


A 


10304 


205 


2 


NFYPA*KTDFFLVGSLFIYFE* ASHPIT 
\ RLECSGTI VTHCSLDLLGS CNP PASAS 
HVS GTAGAHHHC I 


10229 


24130 


A 


10305 


423 


30 


LLEGKLTNRKDIRTKTPSVRRLHQSPKV 
DRTAKMGKKQ S RKS GNS KNQ S AYP PPKE 
HSSSPAMEQSWMENDELREEGFRQSS/M 
S KL KE E VRVNGK * VKTREKRLAKS LTR I 
TNAEKYLKDLMELKTSVLRPL 


10230 


24131 


A 


10306 


211 


387 


IFFFF* IKACSVAQAGGQGRHLGSLQPP 
PPGLKQDSC/ SPSRGVGTTGMTYHAQFI 
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G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P==Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFGF 


10231 


24132 


A 


10307 


229 


382 


ERAWQPG*HGET/PSLKKL*KLSRCGG 
MCLYSKLLRRLR*EDHLNPRSWGC 


10232 


24133 


A 


10308 


168 


417 


GSYCFFVRRNLLIVKYMRSVHLLFIMLL 
YRGD * W YVTTQ YNF C I LQKQ E KL» I YR I A 
/ H VRN P S S LRGGGG * IT*GREFKTSLAN 


10233 


24134 


A 


10309 


253 


402 


KGGGGG VKTPARGNP P P GP P KGGE KRGG 
PPPPTQKGFF/CFFFKKGTRPPGGKTGE 
RAKPNE I PAPGKKEFFLPKPPEKGE I KG 
/PRPPGQINF*FLKKTGFKKGGGGGSKP 
RPGGT P P PGPQRGGKNGGGP P P P P KKGF 
FGGKKKKKNP P R 


10234 


24135 


A 


10310 


182 


400 


LHVTM 1 1 FWLYL I KNMLiLNFTYF/ C YI F 
SVATSTFKIVCV/WLGAVAHICCPHTLG 
G * DGW I T * G QE FKTRLANM 


10235 


24136 


A 


10311 


371 


400 


FIWGNI * ASNKH I KKT \ STS LAFRKI Q V 
K\TTMRYLS PPTPIRMT/ 1 / 1 * KSYSSK 
CW*GCEEIRSLIHCKMIQPI*KT/WQFP 
K I INI QLLYNPAI ALLGNRTRG 


10236 


24137 


A 


10312 


113 


404 


QRCGKIGTFTHCWWEYKMVQPLWKrDWN 
LLQRLNIELPCDLAIPLLNLYPREMKTS 
IYICT*MFIAALFII/AKWRQP/KCSSA 
DE / I GKM WHI C I MK 


10237 


24138 


A 


10313 


146 


1 


FLTQKFFFFFETESRS *AWQWLDLGSL 
QPPSP\GSSDSPASAF*VAG 


10238 


24139 


A 


10314 


157 


434 


FFFSEIFFFGPGVFSPPPFFIGGPGIFF 
FCPLKKKILPPPRGKKFFFFKKGP /HPF 
FFFFFF*EGVSLLLPRLECNGAILAYCK 
IHLPGSSDLP 


10239 


24140 


A 


10315 


277 


2 


FFFTAFFLGVPGVFPPPRFFKPGPGFFF 
LGP * KKNFFPPPRGK/ 1 KFFFLKGP PS F 
FFFFFF* EGVSLLLPRLECNGAI LiAYCN 
LHLPGSSF 


10240 


24141 


A 


10316 


1 


388 


KTITHEVITECIWLKVLLLYFPNCMCIY 
VRMHICVCVCVHISI*MDHIVSYKCIHI 
\LCIY 


10241 


24142 


A 


10317 


134 


399 


LGLPGGDTRGGGTIENPPFERALNSQNG 
GK\NPLLGPKTRPLGWGLCSTPKFWEKP 
RVRWDLN* TTP CLE KRAPWLKAKKTPA 
FEIRI 


10242 


24143 


A 


10318 


233 


3 


EFLFSLICAICL*CALKQFSSCFFCLAL 
IELLGSMGLLFH/ SSLETFQPFFLQIFF 
/SFFPLFFGANYTYNKPFEYSLI 


10243 


24144 


A 


10319 ; 


221 


8 


CDQPKSPPLSGVQMGDILWLVFFLFLF 
AF*R*GSHCVAQAGLELPGSSDP/PQAT 
SQVAGTTGMHCTWLVF* F 


10244 


24145 


A 


10320 


1 


407 


RS TPDP VYLGFTSGGCKTEKRAGCFFLW 
KLHTKGAPARCQPELSGMRC\R*PLLGG 
V/SQSRYMAVKDSLEEAGCPLSELKRCA 
GRYTGLFKGGRLDHLNLLRLCPQTTLCP 
GAIiSQEDGSFIYKPLTGAAAFLSYM 


10245 


24146 


A 


10321 


115 


398 


ADEELLLMNKQIKYLLEMESTPG3CDDVN 
TVEMTKESEYSI/NLKAVAGFERIDSNF 
ERLSAVGKML*NSITCYKEIFH/RKSQL 
MSQTSLLSYFKLP 


10246 


24147 


A 


10322 


137 


454 


FFSLYFQGFFFFFFLGEKFFFSPGGKGI 
CPKPPFGPPPPPGKGFSPPPPPGKAKKK 
GPPPPPP\GFLFF*KKPGGGFLGGFFFF 
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N=Asparagine, P=Proline, 
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nucleotide deletion, V=possible 
nucleotide insertion 














LGGKIFFSPAGGKGGPQKPPFP 


10247 


24148 


A 


10323 


246 


372 


I S I KS YQIjW \ LGGVAYACNS S TLGGRGK 
WIT*GQKFKTSMANM 


10248 ' 


24149 


A 


10324 


415 


1 


I WGAPRVFP P P P F * NP P PE KNFWGP KKK 
KKFPPPPG*KIFFFKGPPPLFFFFFFFF 
SETESCSVT\RLGCSGGISAHCNLRLPA 
SSNSPPPSSLRMPSTPVLRPHGDLLSPQ 
LSAWVLSWPFLPSLLLPPDLDFIVQ 


10249 


24150 


A 


10325 


3 


559 


GDG I TGLSP T I S KFLF FVFES ISLSPRL 
GS S GAI MAHC S LTS RREPPCRS I NFFVS 
VITLNVNNLTNGKIGRMDFF*MM*YTIQ 
I CNLQNTHCRP / KDTNCFKMKRRKE I I H 
ANSNQKRAGVAILIPDKIDFKPKNVTER 
HRMLVRES /I SQEDVTI INTY 


10250 


24151 


A 


10326 


24 


387 


NQGLGARNQKS QRVFLAGF F F FES S S CS 
VAQAGVHW* DHS SV/ PG S S AP PAS AT W 
VGTTGVHNHPWLIF/VFFVETGSH 


10251 


24152 


A 


10327 


145 


396 


CDYFGEMFSYLLLNIYLCDDPAIPLLPY 
VP YENENTD S LKDMTDAHS S F I HNS P KL 
ETKCPGTDEWI *MTKKSDYCKHW*GYRV 
TGTLLYYEGECKI V* LLWGNV* LF\LLN 
IYLCDDPAIPLLPYVPYE\MKTRIHSKT 
*PMPIAALFIIAPNWK/PKCPGTDEWIN 
TL * HTHAVQ YHVAVE R 


10252 


24153 


A 


10328 


94 


639 


ASISSSI KWDNI S FLVGLLGELNE IRNI 
CKAVNMKQALNKWKLjILLFVIENNNVSA 
SSNNFKI FYAYY I AKAVLL 1 1 TEHITLS 
IVL1LKKQNNTRAVTKCW\RGGREECETL 
I WLVG * ECKLVQPLWRRVWRFL I KVKI E 
LPYDPT/ILLGIYSKERKSVYRK\NICT 
PMF\VAALFTTAKTWKH 


10253 


24154 


A 


10329 


143 


3 


S AWFFFQDRVLL * / S PRLECTVS ISNHC 
NLRVPGSSKNYCLSLISIWDR 


10254 


24155 


A 


10330 


153 


389 


RQGLTLFPGLVSNFWLPVLSSHLGLPKC 
WGYQCYHAWP S * CF * NRS I / WS GTVTHA 
YDPSTLGGQGKPIT*GQEFETSLA 


10255 


24156 


A 


10331 


382 


422 


RYKLDIi*KLNTELPYDPTVPLLGNYPKY 
*T*AF\SKTCT*MFIAAYSQPPKGRSKC 
S S PNE * INKMWP I Y I MR/ Y YS AI K 


10256 


24157 


A 


10332 


194 


1 


CKRQ CNKEDRNI / CS K* *NPT I LF* TTK 
RKG/WLVAVAHACNLSTLGG* GGRAS *A 
QEFKTSRANM 


10257 


24158 


A 


10333 


348 


1 


NSGQSWIMYPLLKKGVGRVSPPDPDIVN 
S ISMWCGGRLFSNERFASLLT I KKKLA 
GHGGTCLQSLPLGRLRREDRLSQGGQG/ 
C*GYSE*LCCRCIPAWAAEGDLWSWSPN 
SVAY 


10258 


24159 


A 


10334 


388 


2 


I PGS P KRGPGGE KKKT FFH YL F LRAAP P 
LPRFIFFPPKNPPLKGGGGFFFPPPPKL 
KTPRLR KKKKKENF CP WKKFFKFCKQTP Q 
RGR F F F F F FLRQ S FAVAQAG VQ W CD LGS 
LQPSPP \G* S YHPARTRG 


10259 


24160 


A 


10335 


3 


396 


LLRP P CS LRHND I E I MPHDNPMIAS KCS 
S KKKS C I SLTLS QKIiD VIKLS GKGMLKA 
KTGQNL/GLGHLYRTVRHAVNVKEKCLK 
EIKNAALLNTQMTRK*NSLIADTEKM*V 
V* IQE * AS / HSM P LRQNL I QNK 


10260 


24161 


A 


10336 


165 


442 


GQMELLIIARNFGILLHFNLSIYYRRLF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, 1= Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan T Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possib!e 
nucleotide insertion 














FSIPRYLFIFTYLFIYLFIYFESVAHSD 
AQVECSGTIMALT * KKNASLP /ASAPRA 
AGATG 


10261 


24162 


A 


10337 


33 


430 


TGS THAS ANF CVS L VWTQ L ISRSSGQGL 
ARLQLRCLPCCILFFSFFSLVIYRLHSY 
LRLPWRRICLQVPSGC*QNSFPCNFRIQ 
GSLLL*/MQQW 


10262 


24163 


A 


10338 


284 


3 


SQHFGRPRWVDHLRSGVRDQPGQHGETL, 
SRETP\SLLKIQKLSGHGSACL*SQLFG 
RLRQENCLNPRGGGCSEWRQHQCTPV*V 
TEQDSVPKKK 


10263 


24164 


A 


10339 


414 


2 


ERAGPFSRKIFFSPPLAKKVKYPFPPTP 
RARAHKRP I FS RYVKKE RGNS PDLFWP P 
FFPSLW*KQ*GPSLGRLFYPV/SWGIFP 
SKP*GFPGPRPKVFCFLG/L,PPGPFGGP 
FFFFFFGDRVSLCHSRWSAVAQSQLTAR 


10264 


24165 


A 


10340 . 


211 


422 


ASLGNGGRPSPSLAAALQFDL* LRAGI I 
FALFKAARQGDLS D * RLLLS F \ VCLCPA 
PRGGAYRGRQAFLSWGGLHPDRASRLFC 
LP KQAWAMAGAP P PALLLP CS L I S YCRA 
SNQQDSGGIGPSEPSAGYKLLVPGFLIiP 
AEK\RTIYGGVTRFFRGP 


10265 


24166 


A 


10341 


116 


432 


SRQRMPSTYHVQGSGDTKTGEKPCANGN 
KGCGS S ILRS CLHRKRFS F * S LKALVWC 
GGACLEAQLLRRLRQEAHLSQGVPGCNE 
L» * P G YCT PAW ATE P \ D S VS KKK 


10266 


24167 


A 


10342 


103 


458 


SLNVCVLTTLPS SCSPI S LPLRGPP YS L 
RHRNI E I RP IKNPTMAS KCS S KRKS * MS 
LT LNQ KLE \ M I NLHE E CML"KAKTG * NLG 
LLHQ I VD Q WNAKKKFL KK I KS AI P VNT 
QMVRKQN 


10267 


24168 


A 


10343 


53 


408 


S F Q F F CS VFS P S LW F YLLL VFDDGD VQM 
GFWCGYPFCLLVFLLTDKTLSCRSVGVP 
CRVRCQ CA\ LLGGASQLG \ SRGSGVRD P 
LEEAVCLLSDLQLRAGRTTTLF * AVRQG 
HLSLQRLL 


10268 


24169 


A 


10344 


265 


421 


NNFPPSPPRGGGNPGPPPPPRFFFFFLK 
ISTPGLP P F * VI YLL / YFTLTPS FCLWCYT 
I SVYGCGFVFFFFS FFFFFFEKKIFFFP 
RVEPKNRIPVYPKILFPG*NNFPPSPPR 
GGGNPGPPPPPRFFFFFLKNPGIi/HPF* 
KKGAKFPDLINPPPWP 


10269 


24170 


A 


10345 


207 


420 


WQKGCPVTFMELGWGPLYPCFLHLWIQ 
PTSHRKYIHL/WLGMWHTCNPNTLGGQ 
GGWIN*GQEIETSLTD 


10270 


24171 


A 


10346 


115 


3 


APRGAVKKR/RPWMGAATHVCKPSTLGG 
RGG* IT*GQEY 


10271 


24172 


A 


10347 


41 


421 „ 


FTFVSVCVHFKGIFLSFLSFFFETEVPG 
S PLGP KRGGGGGNS G P WNP P P PGLRG S P 
PPPPQRTGKGGPYPPPPGNFFFKFFFRK 
RGV/PPL*HPGGPQSVT*GTPPPGPPKF 
WGAQSGGPVWGKNPLF 


10272 


24173 


A 


10348 


228 


1 


S KQVKE K YD S F LH K I Q KQ AT * NRTFFRV 
TKKHVE ITKK/WAGHWAHAYNASTLGGQ 
GGWITLGQEFETSLANMVKP 


10273 


24174 


A 


10349 


405 


1 


GGFLKRGPPFFFFLRQGLSLLPRLECSG 
MTVAHCSPNILDSSNLSLRRSWDYSHQA 
QL I FL VS F* VLGSHS VAQAGVELDAS / P 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














KCWDYRHEPL/WPSQSQNLNSFFLDGFS 
LLLSRLECNAAISAHCNLRLLGSS 


10274 


24175 


A 


10350 


301 


546 


RHIFMNLPFHNLVKIIFHCRGKTLFLDI 
SNPYLKKQVSSSKPNCK/HLKEI*LFLE 
AKDYWLGWPHAYWPSTLGGQGRSITG 


10275 


24176 


A 


10351 


873 


1258 


FCYQCYFHLLHHLHHHRPSPRLEPPPPG 
AALPPVPEGRIIPLPCPSSGPRQTTGPG 
KGRRHPAPHRTPAASAASIiPARRTLPAA 
PW*PLSFQNTTPGSA\PGGGNQPASGAG 
GMSRT 


10276 


24177 


A 


10352 


184 


576 


GLSPLSHLKMAASSSEISEMKGVEESPK 
VPGEGPGHSEAETGPPQVLAGVPDQPEA 
PQPGPNTTAAPVDSGPKAGLAPETTETP 
AG\PQKQPRPQTSA*AQEGNQRPTAAPK 
THAQETVS KPE VS KE ATAD 


10277 


24178 


A 


10353 


110 


384 


VPGSPRKTRSPQQMERVLAPKTAKKGCP 
RPGDHRSRLQNCRDTRARIKWADSVERP 
P WAG \ P VAHACNPNTLGGRGGW I T * GQK 
FKTSLVNM 


10278 


24179 


A 


10354 


123 


422 


RKQIANDKSKLFI INKYLNVNELNSPIK 
/ RERLAGW I RKE KHD P S I CHL * KTHLT \ 
KDTNSLKMEG*KKIYHANNY/QKKAGVA 
ILISDKIDFKFKKKWREK 


10279 


24180 


A 


10355 


36 


426 


PPERPQYSARVCVCVFYTECRGLiAPKPE 
CRGAI I AHCLLNVLGSQ*SSNLRVWSS * 
DYTQL I FKLFVQTGSHC I AQAGFELL / S 
SKDPPTLA/ fpkhwdyrh\dhhsw 


10280 


24181 


A 


10356 


193 


3 


fqtrifkisfffkeflkfkkffffili* 
esrsiaqagvqwrdlgslqpppp\gsss 
ppasasly 


10281 


24182 


A 


10357 


205 


377 


spwfqpyyrltipyqkc*latvci*vy 

* FFKL ILYPAILLNS 1 1 IWINFTSDPL* 
FS SH I TG * LSL I KNVW VGPGMLAHA\ *N 
P S TLGG / RGG W IT*GQEFETS LAN M VKP 


10282 


24183 


A 


10358 


51 


376 


LCRPFQLCHGDRTDMWCPFPEELQDRCS 
CRSTSMRVAKGTTPCFLKVLQNIYTV*M 
F CRTFRKHG WPLATRML V \ I YKNSGPV 
TLQGRGTTYIiSCHHGKAGRVCRATQHAA 
GVWTKQRACI 


10283 


24184 


A 


10359 


295 


376 


ASELTSQSPGSLFENQKKGRCIAFLPC* 
KCNSEILVHCWWGCKM\WKIVW*FLKTL 
\RLELVYDPASPLLAIYPRCI 


10284 


24185 


A 


10360 


451 


1 


CQD I KT I LRNEQE VL VI PS VTK YKQVGK 
S FVSRRYLWSGCVLC YAFVTEAS KTWHH 
FPYRSPGFWGRSGGRQFDS*PKSQADFR 
APQGPPTPEPEGAERREPEIAPPPTADP 
S P KPT /HS P P RS GAP P SAP F P LLGRAER 
GVG\PGQGRPRP 


10285 


24186 


A 


10361 


2 


776 


FFFLRWSLALSPRLECSGAISAHCKLRL 
AGSRHS PAS AS RVAGTTGARYHARL I /L 
/ YLFLVETGFHHVSQDGLNLLTS * SACL 
GLPK\C\WIYRREPPRPAYFIFFLRRSL 
ALSPRLE C VQ WCDLRS L.QAVP PG \ LHHS 
PAS AS LVAGT TGAHHHTRLVL I LL \ FYY 
FLRWS VTLS CRLE \AGVQWHNLGS LQAP 
PPGFTPFSCLSLPSSWDYRCPPPCLANF 
LYF**RRGFTMLARMVSIS*PRDPPPSA 
SQSAHDLFIVGQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10286 


24187 


A 


10362 


188 


365 


CPQLHTQSNLINMDASQRYQFELKIQAL 
/WAGTVAHACNPSTLGSRGGWIT*GLEY 
ATSL 


10287 


24188 


A 


10363 


146 


1036 


GWAVYQTL QAAWLG P RT E S GVS FHGRFS 
LPHHPPQPTLHAWPAPPLQPEPCVPLRE 
DPFMPESW*LPPSIPAEGVSF*ECDFSR 
RMMGKGS PVPTVAP PLGPAPE P FCVTAG 
CAPMHTPTHTQ\PPHCSIFLPKISFKSK 
HFY*LLFCKCLSSSVFSLPESILLLFIV 
ESVCQPGERCLALNMGWDGKGSGRRWAT 
KSQGVMD I AATQTPA/ VFS P V / PATS CP 
AGVSGGK 


10288 


24189 


A 


10364 


229 


1 


PLENAKS I YWGKGRLLKNNARLFTVA* S 
WNQPQFSLMVDWIKKK\MWRMYTTESCA 
AI * R/ DE I VS CAGTWMELEAM Y 


10289 


24190 


A 


10365 


186 


449 


RFFFFFPKFFWVFGLKSPFFFFG/RLGI 
*EKDGGFFFFFFFFPEMASGSVA\RLEC 
SGAVSAHCNLQFPGSGSLPASAPRVA 


10290 


24191 


A 


10366 


445 


3 


RVDNGVRGRYRGKRGERGR* TRWGWGHP 
QRWDQDRSRERPRETGRERAAGRRRETE 
AAETGRRRRERGRS DRNRN / RGRPRERM 
I E AGRDAGRRE R * ARQE S W TE KKTARQ E 
EGDRD /RQGHQETRAHKGWGNN*D*GRQ 
AERSENRRMY 


10291 


24192 


A 


10367 


1 


491 


NTAPFPTS LESAF WVAVQDRVS / LLS P 
GPECRGAI \LTHCC/IjRTPG/ FKPISHLi 
S / LPS / SWDYRHVP P PLDNL I LCRDNVS 
QSGLKLLAS SNPLTS AFP ST/ YDYRRE T 
PGPRAS P * TQMQPCFMP /RS*SQP*TQI 
A/R * QSNWGS Q * VLNNRELAFGTMFFGQ 
RPKAFF ' 


10292 


24193 


A 


10368 


572 


40 


LHSFRQHFSNTYPQCTPGTP * VTQSLAA 
RPAGLPEPTSASGGAGAVTPPPRTD\PP 
SVRWSGKKGLDPVLRGGAPRPSPANPAP 
APQGPAPRKGTECVPTASKTPRPGGRWE 
PAAPPHAHPGKDLQPPSTLVKECPEARC 
PRALTGDLPSSGENRNGLSAFQGQRRPA 
QKSGENCEN 


10293 


24194 


A 


10370 


36 


517 


PFLFSDIDFLFSFYFLSFFFF*KRGLIL 
PP/GPEGQGGNKG*WPLHPRG*REPPNL 
TPQTRGNKGNRQTGPVIFGILRKNGFPP 
FGQGGFKPPGLREPPGLALQRGGNNGQN 
P YP /P / PGKFF*RTKRGEKS PQP P YNKP 
NKKFNPAPMEE FS S FKEGRGP S LN 


10294 


24195 


A 


10371 


60 


403 


DVEQPKLRSYPTLKCKPRSAILTICPWA 
LQYRSHGLI FGFSWGFLSFETES CSVSQ 
AGMQWCDHGSLQT/PTDSTSQAQAIDPP 
ASAP \ QAAGTTGACHHTCL I FVFL * RWG 
LAKP 


10295 


24196 


A 


10372 


358 


33 


DSLGVPGAPLLSPGVTPHKQWFFRPYTQ 
IKKLGFFPQIFFGFKV*KAPFFFWGFGN 
LKKKGGFFFFFFFSEMASGSVA\RLECS 
GAISAHCNLQFPGSS SLPTSAPRVA 


10296 


24197 


A 


10373 


264 


3 


RQTSVSGSNDGKSWPPADIDTI \KYVIG 
NRLITTQQ*LIKLTSKQMITIHNTKGRT 
*SLILVSIiI IFIATTNLLGLLPHSFTPT 
THCI 


10297 


24198 


A 


10374 


298 


384 


TL VI *KDGVHTTPHHQQHAYPGPKE YML 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenylaIanine, 
G=Glycine, HNHistidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y— Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possib!e 
nucleotide insertion 














HKAPLLHS VSRDPS PFAQS SNCYNRS I K 
QEPVDPLTQAEPVPRDAGKMGKTPLSEV 
SQNGGPS/HK*GQYSGDV 


10298 


24199 


A 


10375 


84 


518 


IMKLRQKSRSSLKIMRSKLQHTRAS/GT 
QLR/PMLGGNFIALN/AHIKKLGRTQIN 
NLTSQLKELEKQD/RNPKARGRQEIT\K 
I*AELKETET*KKEIQKIYEFRV/WFLE 
NINKP\PASRLIQRMIERNTIRNDKGNV 
TSAPPDV 


10299 


24200 


A 


10376 


70 


363 


FHQKNLVPGLWVQLLGRRTLQSQHPPAP 
PPSSSPEVNVPSEVQAAD*LCCIFYDVK 
VRKCSTPQKKTRKRKKGCLF/CAPGQTR 
/ CLC * KRGE KNLVGKT 


10300 


24201 


A 


10377 


299 


2 


YPVAGRGNR* GCGAHALGVGGAAMPLGA 
TISSVSLTISSLCAFPRRGGLEPSLPAP 
HTPPPPPE/P*ASSPSRGEWHLVPPCMQ 
PGLPPLPPPRLSPLV 


10301 


24202 


A 


10378 


302 


3 


WPTVATTCS CERKSHTSLT *NQKLEGTQ 
LS EEGMS KAE IGQKPGLCHQT / VQWNA 
KKKLLKE I KNAT P VNTQITRK\ QNS L I A 
DT\EKV*VIQIEDQTSHV 


10302 


24203 


A 


10379 


125 


375 


NEYVFLEKSLKLAGRVGSCLLGRPRQVD 
HLSSGAQD* PGQRGKTPSLQNIQKLSRH 
GGMCP/RRLRWEDHLS/SRGG*GCSETR 
ST 


10303 


24204 


A 


10380 


458 


3 


ITISTISCKATSFERAPRASRPPPDEWV 
SQAGPAPLRGWEN* GTRRRDDWPEASLK 
DPRRSGEPEAPL/PLRLML* SRLPPPLR 
WRRRPPPPDSHTPSPGNGKSPRVPRTGS 
KDKRPE ALPP * FAAGEPLEANVPPVQHG 
GGSVRSRGSYLTAT 


10304 


24205 


A 


10381 


327 


2 


LSAPKKFFFQKSLFLGANFRSPPFPGRP 
FTNRFSDLNVNPLTKKIYFSHKSSNPPT 
FFFFFFETGSNSVT*SGVEWRNHNSLRP 
ESPG\SGDRP*V*HTTGAPYHTWIiV 


10305 


24206 


A 


10382 


754 


1 


NTALETDTTFSGGGGGHQLCVRSRVSGG 
G*LIMPDLHNHPLQQTSGRNQRLQGEVT 
DPS PGMSGGAQTGPKAP PR P P S ALRHSG 
SIjQGQP * FAHLYCGT I P YPC YQTAPLPR 
APPPTPTGH*CPGEA\PQDELPVCGCSP 
WPRPPGQAQAGRNKH* FSGQRSSRDTTA 
CSPCSQSKQSPSRPPCVNDS/PGQGPGP 
KAPAGP P PALPEASLGS VHCS P * VGVMT 
PTRPGHFLVPGKVTLEQAPGGGASGGPV 
P 


10306 


24207 


A 


10383 


39 


433 


SHDQTYTEEELLFMDEQRKWFLLMESTP 
GEDAVTSTWE ITTKNLEYCINLVDSTVA/ 
GVSNFERRSTVGQMLSNSIVCYGKIF/R 
*KR/SQSMWQTSLLF\NKLSQPP*PSTP 
TILISQQPSMWRQDPPSEERL 


10307 


24208 


A 


10384 


283 


1 


NVFSCKPTS SCLIS PDS PTPDRRFS KAD 
T YRNFMF PLRRHTHMHTHT YMHTHVHVC 
TQIiCLHIiRPH/ ITHGLSL/TCAPPVL*R 
PSFKPLHPTPV 


10308 


24209 


A 


10385 


12 


471 


SPPWWTHGFRYWEPTPPRFSVPHDALGQ 
FHTALFRLTRRFQLLSS CINI S PHTCLP 
PCLGLCVGGPLEPAVCWAR/CPP*RGMV 
SGWKASPLG/CPPTLLPPC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
fast amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glu tarn ic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, \=possib!e 
nucleotide insertion 


10309 


24210 


A 


10386 


287 


1 


S FCVRS RPAPTSGRT P FPAAP PFPAAP P 
FPAATTPSTAEDGVPG*G/PAPAADTCSP 
HSSQRGSAAMGIPLPQAS/PPAPTPAPP 
SP*SIiPFPPAPMY 


10310 


24211 


A 


10387 


336 < 


3 


L I E G KLTNRKD I RS KT AS VRHHHQS P KV 
DKPTKMGRNQSRKAENS \ KI KSASS P PK 
ECS S S PATEES WMENDELREEGFR* LVI 
TNFSELKEKTGKIiNPAALVCGRSEDQTT 


10311 


24212 


A 


10388 


15 


470 


APSPDAMGR\FTEEDKATITSDRCKVNV 
EDAGGETLWLQWYPWTQRYFDSFGNL 
S S AS A I MGN P KVKVHGKKE L TS L VDAI K 
HLDDLKGTFAQLSELHCDKLHVDPENFK 
LLWVLVTVLAIHFGKEFTPEVQAYLHK 
M VT WANS LR YK 


10312 


24213 


A 


10389 


3 


438 


QTQRE PTMVL S PADKTNVKAAWGKVGAH 
AGEYGAEALERMFLAFPTTKTYFPHFDL 
SHGSAQVKGHGKKVADALNNAVAHVDDM 
PNTLSDLNDLHAHKLLVDPVNFKLLNHC 
LLETLDGHLSAEFTP\AVNASLEQFTAF 
EITWLT 


10313 


24214 


A 


10390 


2 


444 


TMSFNTLAICLDCLCSTLQPTRSIPGYP 
SSPLPGNPTPPMTPSSSVPYMSPNQEVK 
SPFLPDLKPNLNSLHSSPSGSGPCDELR 
LTFP VRDGW/ S GALP PAAQ PGC 


10314 


24215 


A 


10391 


405 


3 


KMILTCLALPAQHCPPSACPQTMAQPPL 
PL S I KGAASMS D KL P YKVAD I GLAT WGH 
KALDIVEN/EMPGLMGMQELYSASRPLK 
GACIADCLQITVETAILIETLFSLGVQE 
QWSSCSIFSTQEHAVAVFAEAGMP 


10315 


24216 


A 


10392 


248 


5 


RLNKVGGHGGMYEHDYVWKLETLQEMYK 
FLD I YNLPNLS LE E IE \NLNRP I TSNKF 
ESVIKHFPTKKISGLDGFTAEFLTNN 


10316 


24217 


A 


10393 


262 


3 


LPRDRQAGRS QGP WPQAVGRARKMP P F 
I SYQVYS KNFFE I ESCS VTQAGVQWRN I 
SSMHPPPP\GSSDSPASASQAPGIPGAH 
HHA , 


10317 


24218 


A 


10394 


287 


410 


ALKGSSRILIMLGAWLTPVIPA/LWEAK 
AGRSSEVRS 


10318 


24219 


A 


10395 


175 


1 


GGREVDDEIPLVRESTIFHIFFFFFETE 
SHSVAQAGGQWRDIjGSLPSS\PPG\SCH 
SPA 


10319 


24220 


A 


10396 


661 


963 


STLIAFIVISTLFPLLDMTEIYFSLLDE 
IVDTLGEGAFGKWECIDHKAGGRHVAV 
KI VKNVDR YCEAAR S E IQ VL EHLNTTD P 
NSTFRCVQ\MLEWFEHH 


10320 


24221 


A 


10397 


1 


268 


PQTHREATMGLSTADWTNVKAGWGKGGA 
HAGEDGAEALERMFLSFPTTKTYFPHFD 
L S HGS AQDKGHGKKVADAL TN \ AVAHVD 
DMRNAL 


10321 


24222 


A 


10398 


33 


302 


SRGATL I YVDNENGE PGTRWAKDGLKL 
ESGPS IKALDGRSQVSTPRFGQTFDAPP 
AIiPKATRKASGTWRATEES V\ KTKGHL 
VQEHP 


10322 


24223 


A 


10399 


16 


233 


P TD YE FNTTT YRE CGPRRPE FS TS LDLL 
SQPCRAVYM\VGEKNDIPFELRIVDLIT 
GKSILGFGEPKSQEGDR 


10323 


24224 


A 


10400 


177 


1 


KLLDLGQPQSPALAACLEYSGAIAAM\C 
NIjCLLGSSNPPTSASRIARITGVHHAEP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, ENHistidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














YA 


10324 


24225 


A 


10401 


185 


419 


HVNKDTL E S HL VW KLKS AL LGRKS E S VT 
GPSCHPEPSDKTVEL/WQGAVAHACHLS 
TLGGQGGW I TLAQ E FET S LDN I V 


10325 


24226 


A 


10402 


104 


3 


GIILIP KPGRDTTKKENF / RPL I PI DAK 
ILNKILAN 


10326 


24227 


A 


10403 


3 


344 


SQVMAVAGPAPGAGARPRLDLQFLQRFL 
QILKVLFPSWSSQNALMFLTLLC/LTLL 
EQFGNYQVGLI PNQYYGVLGNKDLEGFK 
TLTFLAVMLIVLNSTVRSFPLSVSLVLS 
SPV 


10327 


24228 


A 


10404 


3 


404 


ARAQRRGDLSATGRNWSPLPPAGLPATV 
VLRHSGSLMAATCE ISNI FSNYFSAMYR 
S\EDYTLASVPPAATFGADDLVLTLSNP 
QTSLEGTEKANWMG\EQSQFWPKTQGLD 
WISFQAEKNKYEASAIGFPRWDK 


10328 


24229 


A 


10405 


254 


443 


NQLSSIMVMFKKIRSFEEDFNDPEKVYG 
SGDKVAG\RLIVEVCEDTRVKAVRILAC 
GVAKELRM 


10329 


24230 


A 


10406 


26 


461 


GEVARRS CCGAMAGTAL KRLMAE YKQLT 
LNPPEGIVAGPMNEENFFEWEALIMGPE 
DTCFE FGVFPAI LS FPLDYPLS P PKMRF 
TCEMFHPNT YPDGRVCI S I LHAPGD /DP 
HGLREQ \ PERWS P VQS VE KI LLSWSML 
AEPNDES 


10330 ' 


24231 


A 


10407 


1 


463 


QQAAKMAENSGRAGKSSG/ T/ PARGRGR - 
C/PAEQVIAGFNRLRQEQRGLAS KAAEL 
EME LNEHSL VI DTLKE VDETRKC YRMVG 
GVLVERTVKE VLPALENNKEQ IQKIIET 
LTQQLQAKGKELNEFREKHNIRLMGEDE 
KPAAKENS EGAGAKAS S P 


10331 


24232 


A 


10408 


180 


446 


KKAAAS VSAAS GSHLSNS FAE PS RSNGS 
MVRH S S S P YVR Y PSD KP FLNS DLRRS P S 
KPTLGYPESNSRAIFSAL/KNLQD/KIR 
RLNL 


10332 


24233 


A 


10409 


1 


417 


EAGLVTMEEIGIIiGEKAQDE I PALS VSR 
PQTGLSFLGPEPEDLEDLYSRF\KKLQQ 
ELEFLEVQEEYIKDEQKNLKKEFLHAQE 
EVKRIQS IPLVIGQFLEAVDQNTAIVGS 
TTGSNYYVRILSTIDRELLKPNASVALY 


10333 


24234 


A 


10410 


305 


53 


LEAQPHAPPLLGHHLPCWRIPSHLGTCR 
RPRAMRVQGSEEEGVA/GQAPPLPWVHR 
GSLP LVRRRPAGKP TGGNAREE KGKVEG 


10334 


24235 


A 


10411 


419 


2 


NNSRKFGNGTLKLVEVEIYPLI SES I PD 
KSELLLTPQDKKHAPLPSAVPNPSA/CP 
DLRPHTTPASPVS \ PAKTVETRP S APQG 
PLPGPVRRPRGRFS VRAAS PQTSGRPGA 
PRPPGPALiREATDAPRAATPP IAALAGH 
S 


10335 


24236 


A 


10412 


74 


322 


MDEMATTQISKDELDELIEAFAKVDLSS 
NGFI CDYELHELFMEANMPLPGYKARET 
I QKLMLDGDRNTDGKI S \ YDE FAYVSM 


10336 


24237 


A 


10415 


1 


415 


PQTHRE PTMWASADKTNVMAAWGKVGA 
HAGEYGAEALERMFLTFPTTKTYFPHLD 
LSHGSAQVKGHGKMVADALTNAVAHVDD 
MPNALYALSDLNAHKLWLDP / VNFMLLS 
HCLLVTLVAHLPDE FTPAVHACLDKLL 


10337 


24238 


A 


10416 


19 


454 


AP S PNAMGHCT E E D KA.T ITS L WGMVNVE 
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NO: of 
nucleotide 
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NO: in 
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Predicted 

beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W— Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DAGRETLGRLLWYPWTQRFFDSFGN\L 
S SDSAI MGNPKVKAHGKKVLTS LGYAI M 
HIjDDL KGT FAQL S E LiL CD KLHVD P ENF K 
LLGNVLVTVLAIHFGIEFTPEVQASWPK 
KMNAVA 


10338 


24239 


A 


10417 


2 


395 


AIjS S S Q P S APMHP F YTRAATM I GE I AAA 
VSFISKFLRTKGLTSERQLQTFSQSLQE 
LliAEHYKHHWFPEKPCKGSGYSMIGVNP 
\KMISLILQACIEGGI 


10339 


24240 


A 


10418 


2 


374 


LSTPHAFGTGKWGTFISLLFLFSSAYSR 
GVFRRDAHKSEAAHRIKDLGEEDFTALV 
VIAFAQHLQQWP FEDH\VQLANDVTEFA 
TTCAADESSENCDQSLHTLFGDIiLCSRA 
TLDESHCVQDGRP 


10340 


24241 


A 


10419 


48 


347 


DPKAQLPEPLRVLWQLTYAMAAGSR/TS 
LL/LAFALLCLPCFKAGPPNRPLSRLFD 
HAMLQAHRRTTAIDTYQEFENLYPKD/Q 
Q YS / FLMTPTS S ALDS I PTPS 


10341 


24242 


A 


10420 


166 


1 


NFLKKPGPFQKFFQRGLPLILTFFFFF\ 
FETKPHSVSL\LECGGTVSAHClSrLCLPV 


10342 


24243 


A 


10421 


2 


251 


LGCTQHRSQELVAAATSHQTCIQASEDV 
KE I / FARARNGKYRPLKI S I ENGQLM I G 
SY/SSQPSDSWDNDYDSFVLPLLEDKQL 
CY 


10343 


24244 


A 


10422 


251 


1271 


KEDLSPRAPMSGTQSTITDRFPLKKPIR 
HGSILNRESPTDKKQKVERIASHDFDPT 
DSSSKKTKSSSEESRSEIYGLV\QRCVI 
IQKDDNGFGLTVSGDNP\VFVQSVKEDG 
AAMRA\GVQTGDRI IKVNGTLVTHSKTHL 
EWKLIKSGSYVALTVQGRPPGSPQIPL 
ADSEVE P S VIGHMS P XMTS PHS PGASGN 
MERITSPVDMGEENNVVHNQKVEILRKM 
LQKEQERLQLLQEDYNRTPAQRLLKEIQ 
EAKKHI PQLQEQLSKATGSAQDGAWTP 
SRPLGDTLTVSEAETDPGDVLGRTDCSS 
GDASRPSSDNADSPKSGPKERIYLEENP 
EEQKKG 


10344 


24245 


A 


10423 


198 


569 


QRNMVGQRLVEPRRLKPGFINVKSYNGD 
WE WHQGFFLVCFFFLRWSLALS PRLECS 
GAISTHCNLCLPGFKQFSCLSLPSSWDY 
RHAHHCTQL I FVFLVEMG\ FHHVGQAGQ 
GFFSLEKSLiTI I 


10345 


24246 


A 


10424 


2 


343 


PQTQRE PTM VLS PADGTNVYAAWGKVGA 
HAGEYGAEALERMFLSFPTTKTYFPHFD 
L S HG S AQ VKGHGNKVS \ DAL TNAVAHVD 
DMPNALSALSDLHAHKLLVDPVNFKLLS 
HC 


10346 


24247 


A 


10425 


31 


342 


RAAVMPREDRATWKSNYFLKI IQLLDDY 
PKCFIVGADNEGSKQMQQIRMSLRGKAV 
VLMGKNTMMRKAI \RGHLENNPALEKLL 
PHIRGNVGFEITKEDLTEIR 


10347 


24248 


A 


10426 


196 


2 


KNLIISQTKKVICPNTLFLVFPKRVFFF 
FFFFETESRS / VS PRLECS A\ S I SAHYN 
LCLPGLSDSS 


10348 


24249 


A 


10427 


1 


405 


RLE CWLE P PHGAGLQGLG WVWS CS VS TG 
PTMQAL V\ LLLC I GALLGHMS CQNP AS P 
PEEGYPDPDSTRALEDDEDPIFIDPENK 
LPEAGSNFGYDLYRLRAIMS PTSNVLLP 
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C=Cysteine, D^Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
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Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














PLMVATDLWDLSLEAQQLTES I I 


10349 


24250 


A 


10428 


223 


361 


EGPTEENMAAKVFES/IGKFGLALAVAG 
/GPVNSALYNVDVGHRAVI FE 


10350 


24251 


A 


10430 


3 


359 


LTQRE PTMALS PGDMTNVKAAWGKVCAH 
AGDYGAEADERMFLYFPTT\ KTYFPHFD 
LSHGFAQVKGHGKKWDALTNAVAHVDD 
MPNALAALSDLHAHKLRVDPVNFKLLSH 
CLLVTLA 


10351 


24252 


A 


10431 


273 


1 


NHKDGKKQSGKTEKSKNQSASPPPKERS 
SSPATEQSWTENDFDEFREEGFI\RQSN 
YSELKEE IRTHGKEVKNLEKKLDDRLTR 
ITNAQK 


10352 


24253 


A 


10432 


211 


3 


S FLWKFCLRGI PSHVSCQSA/ LLLGGAS 
QLGYS GVRDPLEEAVCL FSDLKLHAGRT 
TTLFNAVRQGHLSLQ 


10353 


24254 


A 


10433 


281 


2 


TNQEKKKGDPKKIRSGKGDTTTDNTVTI 
IRDYYG/HTYTNKLKN/LEEVDKFLDTY 
YLPRFNQRE IENMNQPITSNE I ESVI KS 
LQTKKSPGPMAS 


10354 


24255 


A 


10434 


219 


1 


SFLWKLRLRGAPGCMRCQSA\LLGGVSH 
LGYSGVRDPLEEAVCPYSDLNLCAERTT 
TLFKAVREGCLSLQKF 


10355 


24256 


A 


10435 


133 


2 


FPKPPGNFFFFFETESHPVT\RLECSGA 
I S AHCNLHL PGS SDSPA 


10356 


24257 


A 


10436 


248 


376 


KGGVFFFFFFFFETESCSITQTGAQWHT 
\YGSLQPRPP 


10357 


24258 


A 


10437 


250 


1 


KGTRKE LT FI E QLL S AKY WAAGCMH YFR 
GSGKYSNGYPVFVFVFIFSETESCS\VA 
QAGGQWCDLVSLQPPPP\GSSDSPGASF 


10358 


24259 


A 


10438 


354 


2 


KRGLKKNPFFNPRITMGEFLKLWGPNPKK 
I FKNCKI FKMGLKKKVLGF YKNS KNLKR 
GKLNPFVIFFFFFFFFFFFLETGSYSIA 
\RLKCSNSGTNTAHCSPELPGSRDLPDS 
TTQVAG 


10359 


24260 


A 


10439 


126 


1 


GMYDCVCLFCFAIKEGREREKEREKERK 
RERQREKER/ERERDKERKREERK 


10360 


24261 


A 


10440 


29 


347 


IQKPTADTKKFFKKKLKYTTRENHLHNK 
EDRKEGKKEEKTTKQHGSSKSLLIMKTL 
NINGLKS P I KTHRVAE / WNKNKDPML YC 
IQETPFTYKDTHRL/RIQGWKKIFH 


10361 


24262 


A 


10442 


405 


1 


I FFPHP FFFKKS FRLSPQVE I QG I FLGP 
WKLFPPGFNNFS /CPNLPNNLGFKGLPP 
RPSYFFRIFCKKKGF/H/RVYPEGFPFL 
ALGFFPLFPFKSWGFQGDPPSWGPFFFF 
FFFVLVAEGTLSEGCRLIRGRWKNLjVP 


10362 


24263 


A 


10443 


122 


372 


LYNDP I S I YQVTLYRRVFFWVLYSANWS 
LRWKANFLF FS FFKMASDP VAQAGVQWC 
DLiDSLQPQPP\GL\SDTPTLASQVSETT 
G 


10363 


24264 


A 


10444 


171 


3 


RHPTMAGS S EMFLGSGFGCRLTVF VLFE 
TESCSVT/RECSGAISAHCYLSLPGSSN 
LP 


10364 


24265 


A 


10445 


173 


3 


CMNWCFSIFIFLKINFILETESHSVSQ 
AGVQW \ L I TVHCSLKLMGS SDP PS SAS Q 
V 


10365 


24266 


A 


10446 


187 


329 


SFLWKFCLRGVPGHVRCQSA\LLGVASQ 
LGYSRVRNPLEEAVCPFSDL 


10366 


24267 


A 


10447 


27 


226 


YS YPVYLLLH I FLCWKCSDYF I YFVFEM 
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ESCSFAQAGMKWCDLQPLPPG\SSDSLA 
PAS Q VAGVDAA 


10367 


24268 


A 


10448 


119 


296 


SPIHYILVIIICFIHVILLIIIFSEVGS 
HS VAPAGV\ SGAI I AHCNHKLLGSS S P P 

vsvs 


10368 


24269 


A 


10449 


111 


363 


SLKKLVFFFS I YTHNFFPFFDPTLKNST 
RHSLYLSLSLSLFLSHSV\SRLECGGTI 
IAHCSLNLPGSSDPHTSMSQV\AGPTGS 
C 


10369 


24270 


A 


10450 


218 


1 


QRFWGPGPNKVFKNLGSAFFLGRGKLFF 
LE I WGGFFFFFLVFFLRQSHS IAQAGVQ 
WCD\LNSLQPLPPRFK 


10370 


24271 


A 


10451 


179 


3 


GPFKLTLFFMGKFALGREKCHFFFFETE 
SCTVAPAGLQWRDLGSLQPPPP\GSSDS 
PAS 


10371 


24272 


A 


10452 


71 


232 


SSPLSDLLFNIVLEVLAKDIMQEVE/ IK 
LILI GKDEVKWS L IVDKI I L YVKNS KH 


10372 


24273 


A 


10453 


153 


3 


ARGELYFLE IGGGFFFFFLVFFLRQSHS 
IAQAGVQWCD\LNSLQPLPPRFK 


10373 


24274 


A 


10454 


338 


221 


MGFLHVGQASLKLLTS I CP \LGLPKCWD 
YRHEPPRPVIPS 


10374 


24275 


A 


10455 


214 


366 


YNYKYLIHYYSYLFFIFIFF/NIIHYIF 
FILFFYLTSFIFLSYFFFFFITLF 


10375 


24276 


A 


10456 


155 


2 


DRVSLLLPRLECNGAILAHCNLCLLGSR 
HS / P RVAG I T VAHHHARL I F C I F 


10376 


24277 


A 


10457 


870 


1249 


EGPRWADHLRPGVQDHLG\QHGETPSLi\ 
QKI QKLARRGGTC / LCQSQLLGRLSQEN 
CLNLGDGGCS E PRLHHC I PTWGDE \ GD S 
PKKKKE 


10377 


24278 


A 


10458 


222 


404 


KQRPGRAQWLTPVIPKLWEAELGR\MLE 
ARTLRPAWAT 


10378 


24279 


A 


10459 


122 


362 


LSEDQLNCYPQRLHHFIFLPVITRVQMS 
HYIiCQQSCLFHFF/ SFFFLYFEMESSS V 
T\RLECTGMISAHSNLYLPGSSQSSG 


10379 


24280 


A 


10460 


3 


102 


AASTLALSPRLECNSAILAHCKLH/LPA 
FTPFSCL 


10380 


24281 


A 


10461 


133 


339 


KLSKFQGSKKPEIKANSLDIALFYHNYI 
TRQCFTS YSNFFGDRVSLBS PRLECNGV 
ILAHCNL/RTPGFK 


10381 


24282 


A 


10462 


3 


193 


ASNMMTELKSQIP ILTLNVNGLNAPLKR 
HRVAGW INDPT I C YLQET YFT / CKNNHR 
LTAK 


10382 


24283 


A 


10463 


126 


314 


RLGGS T I KVQRD L S S DRS KP GR FL YT S N 
SS YVRKGKRNKACFTERVP / PVNDI I 


10383 


24284 


A 


10464 


1 


332 


kknlflskdclsslfrfsksppailgpp 
rgwtpktrfigpkafffkkgslslgrky 
hfgl kp vp / cp p fl s ktr fl pg vp ffff 
fffeteshsva\rlecsg/aisahcklr 

L 


10384 


24285 


A 


10465 


173 


365 


rfvcstikvlrdlssdrsnpgrflslqi 

PP \LRKDKRNKAYFTK/RPS PVNDX1ST 


10385 


24286 


A 


10466 


335 


3 


pfkkfyfkistrallepvffgkkirvpp 
qrlspiwaplfkrgppnsfqvngvplff 
ffffvrhglalssgvecsgtrlecsgai 
/ stahchlchldashpptsaswvarptg 

T 


10386 


24287 


A 


10467 


818 


1008 


VHTDVLAD I LKS INNAEKRQT/ RRCFLS 
\WLQGHCPFLTVMMQHGLCGVELGGLPG 
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K=Lysine, L=Leucine, M=Methionine, 
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nucleotide deletion, \=possib!e 
nucleotide insertion 














PGDFCKRPR 


10387 


24288 


A 


10468 


111 


1 


MVSFGRPRQADHKV\GVRDQPGQHGETP 
SLLKNIRI 


10388 


24289 


A 


10469 


204 


1 


LWGPILKIFCQILKTICEGNFRGPTFFI 
FFFEMESQSVTQARMQWCDLSSLQPLPP 
GS\SNSRASAS 


10389 


24290 


A 


10470 


190 


330 


ERI KKQDLS I CCLQVTHFTFKDSQRLKV 
KGWKK\ I FHTNKNQKR I WT 


10390 


24291 


A 


10471 


162 


2 


IKGPKRAPPAJSTKGRAFFFFFFFFETESC 
SCLQAAVQWHDLSSPQPPTP\GSSDS 


10391 


24292 


A 


10472 


120 


3 


SWGF F FFFF \ EVE S YS VAQ VG VQWCDLG 
SLQAPHVMILVK 


10392 


24293 


A 


10473 


298 


1 


RRAL PCPANFLYF YFLS LLLFFETE FHS 
\VAQAGVRWHELGSLQTPPPG\SSDSPA 
SGEYSANYTGEYSANYTGEYSANYTGEY 
SVNYTGEYSVNYTGEY 


10393 


24294 


A 


10474 


1 


380 


AVTGRAGSMVAPR\PLRRLVLFYQGKLR 
SMAGNFWQS SHYLQRILDKHDLLNERQK 
DLKVLSEEEYWKLQRFFTKVIQALCEHL 
KLRQ P G I ATATAY F KRF YARHS LKS IDP 
VLMAPTCVCLAS TAN 


10394 


24295 


A 


10475 


289 


2 


TQGFPPINNS FYALYPKQKIPFFSDKLF 
RVKGFKGPVFHYGVKDFKNKDGVFF I FF 
FEMASGSVAQAGVQWRNLGSLQPPI \PG 
SSSLTTSAPRVAS 


10395 


24296 


A 


10476 


138 


402 


ITGRAPPKP FPLKRGYLSGGLTNIGFS F 
SFSFFETESRSVTQAGVQWRNLGSRSWL 
EST/SPDSSNS/PAPASRVAAIT 


10396 


24297 


A 


10477 


2 


336 


ARETNDTTQTRLLPANNTMTRSDPYI S I 
LTLNVNGLNTP I KR\KRVAS WINKQGLM 
GCCLQETHLSSHETHNDTHKLKIKTWRK 
I YQANGKQKKARW/ 1 L I SDKTDFKP 


10397 


24298 


A 


10478 


51 


250 


RKVSLFKKWCWNTLFICKK\LSLDTYRF 
AKINFEWITDL3WREKT/IKLLKGNIKE 
NICDLGLGKDFF 


10398 


24299 


A 


10479 


216 


2 


GFTRKALSGFFKTGWFWVFPNFSKLFFK 
KNFSERFFFFFFFEMESHSVT\RLECSG 
PISAHCNLHLPGSSLV 


10399 


24300 


A 


10480 


191 


3 


FSTDGNVNCLVKYS I Y I YI YFFFFL IFF 
FEMKSCS/VSPRLECNGALSAHCKLCLP 
GS CHS PARA 


10400 


24301 


A 


10481 


135 


768 


LGGQVLSCASQPVSLFPLFFFLISFLGC 
LLAFLS ACFASTFS CVLGS PETS FSPCL 
HPPPKVLSSHP\PPCSRP/CPQPPKG/P 
PLPKHACPP 


10401 


24302 


A 


10482 


126 


333 


MWGQRYCVMCFVI \VLCLFLLV/ WWW 
FLW/ CS VY IVY WYG YAWL F I FFFFFFS 
LLCFAVCLFVWLLDRF 


10402 


24303 


A 


10483 


240 


2 


ATEKNMGLGTSVRGLSGSNPSYGRVRER 
I GGGVCIiWSQ I L LRGC \ LRWEDHLGLGG 
GGCSEPRSSHCTPTWVTEQDPVLV 


10403 


24304 


A 


10484 


136 


2 


NRPS YI FS FFFETE SHSVAQAGVLWHDL 
SSLQPLPP\GSSDSPARA 


10404 


24305 


A 


10485 


161 


328 


EKNFKPYGSCRHFYLFIYFETESRSVA\ 
RLE CSGT I S AHCNLRLPGS SN/ S PAS 


10405 


24306 


A 


10486 


193 


347 


ILFSFLLFIYFFETES CS VAQAGVQWHD 
HGSLRSRAPG\SYNAPALDSQGTE 


10406 


24307 


A 


10487 


114 


2 


GAWQS E VQWRDSGS LQAP P PG \ SRHS P 



1183 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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ASASQVAGLV 


10407 


24308 


A 


10488 . 


314 


49 


IRRSTERSEPRYSMVCRRQPRQGGPCL\ 
RSQLLGRLRQEDHLSPIAGDCSEPRLH\ 
LACTPAWATEQEPVLKKMS PTLFCLFLW 
PLPCQ 


10408 


24309 


A 


10489 


291 


1 


KS PP P PNRRGLTLR SRGQ I GNGVS FY YE 
APVTDFIFFFQSLVLSTQAGLQWCNLSS 
LQLP P PG\ S SDS P AS ASRVAP VS QGHAT 
APQPGRQTLSSC 


10409 


24310 


A 


10490 


226 


3 


GSRCTMPRRRIAHAFPAARMPKRKVSST 
EWATEEE PKRRS ARLS AEPAPAKVETK/ 
PE KAAGKDKS SDKKVQTKLV 


10410 


24311 


A 


10491 


2 


153 


ARGLVFFVYLVEMTLCHVAQAAVKPLTS 
SEPP\LGLPKCWDYRREPPRPT 


10411 


24312 


A 


10492 


123 


342 


THYLCLLRFFFLDRGFLLLPRLDC/NAP 
I S PNLNLRLPAS SDS PASAS RVTGITW 
NHYARLTLLFIFQNETbF 


10412 


24313 


A 


10493 


194 


2 


IPGWPFKNFSLFPLFFFFFEMESCSVAH 
AGV/GS S DF PASAS QVAGTTSMR IMAWL 
IFVFFSRN 


10413 


24314 


A 


10494 


373 


1156 


KVQLKVYLFFFLRLSLALLPRLECSGTI 
LAHCNLP PSRFEQFS CLSLPS SWNYRRP 
PPHHARL I FVL/ S VETGFHHVGQAGLEL 
LTSGDPPASA/FPKCWDYRHE 


10414 


24315 


A 


10495 


181 


3 


S FLWKFS LRGVPGRVRCQS A\ LLGGAS Q 
LGYSGLRDPLQEAVCPFSDLKLCAGRTT 
TLFK 


10415 


24316 


A 


10496 


156 


2 


AASTTRLGDRDQPGQHSETPSLLKIQKL 
G\SGGGHLRLRQENCLNP\GGRGC 


10416 


24317 


A 


10497 


157 


300 


I QSWFNI LKS IHVI HHSNRLK/ EVNH IV 
STDAEKAADKIQHPFLIKKQ 


10417 


24318 


A 


10498 


3 


289 


L VKKML TG S TLG KS YRHS P F S I NQGHNA 
LRKAAG\ PL PRKAGY\liQGFS PLRYGL \ 
WD GKD L T I HQ PDTREGS VL S R I S KRGR P 
L/CSCLPLGTECLSI 


10418 


24319 


A 


10499 


138 


1 


W YGL I YLLIiEME S C YVAQAGVQWHDLP S 
LQPPPP\NSSNSPASSG 


10419 


24320 


A 


10500 


138 


1 


WEGLIYLLLEMESCSVAQAGVQWHDLGS 
LQPPPP\NSSNSPASSC 


10420 


24321 


A 


10501 


200 


343 


LEAHS FLSFFIFIII FFDTKSHS VT \RL 
ECSAAI S AHCSLLL PGS SNS 


10421 


24322 


A 


10502 


84 


339 


PRDGKRRDKQKQETGGEHYKS QLRGRLR 
QEDHVSPGGRGCSEPRSRHCTPAWAT\E 
SNSVSKKS 


10422 


24323 


A 


10503 


141 


362 


E WNCCVCC / CCCCCCRCF 


10423 


24324 


A 


10504 


198 


320 


TIKKQKNTGGLIFNSYILPPLFIiEPGDL 
\RFLDVDNRVKLPV 


10424 


24325 


A 


10505 


30 


356 


GYPCNPILYYMSTVGFTTSLLDFDIPLE 
HP ILKVCPTHTRALRDLSSDRSNPGRIL 
STSNSSLY/EKDPRNKAYFTK/RPSPVN 
DUST 


10425 


24326 


A 


10506 


16 


375 


KKFFKKKKKKIFFPPFPLKNFFFPKRVN 
FFGRGWPKKS PPKKKVFFKKFPGGFKKP 
PLKKKKI FFS PPVKMGPPKGFFKKGPPP 
FFFFFFFFFFFFFFFFFFFFL/LCLIPR 


10426 


24327 


A 


10507 


156 


2 


AASTTRLGDRDQPGQHSETPSLLKIQKL 
GXSGGGPLRIjRQENRLNPXGGRGC 


10427 


24328 


A 


10508 


313 


2 


QSKWLLEEESTLGEDAVNIVEMTTKDLE 
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X=Un known, *— Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














YYRNLVDKAAAGFERIDSNFESSSTVAK 

I LSNC ivc/y/rei lhegksrpmwqtsl 

SYFKELPQTTQHSAIITLTSPRA 


10428 


24329 


A 


10509 


325 


1 


E CAKE MNAE T KRN I AI DAT E TQR 1 1 RD C 
YEHYNKMENLEAMDKFLDTHNLPRLNHE 
EIENLNEPIMSNETETV/l/NSLPSKKN 
SGSEGFTAEFYQRYKTPLILKLFKK 


10429 


24330 


A 


10510 


126 


14 


DSSGQVQWLKP I I PV/ LWEPEAGRSPEV 
RSSRPACPTW 


10430 


24331 


A 


10511 


435 


769 


P FKFP PLSHACDGGS LFCRLRLE CSGV I 
T \ AHCKPQLPG/ S OS DFPAS TS QELGP / 
TGACHYTRLICCREEVL\HVAQAGLELL 
CPKDPPTSASQHARDYRH\GHPAQL 


10431 


24332 


A 


10512 


147 


412 


RFVCST I KVLRDLS SDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNE 


10432 


24333 


A 


10513 


251 


466 


QTCPTGSPASLLELFNSIATQWELVRSL 
I AGNALKDEND \ SAVKMLAS L I MSYKAA 
AAEDYKADCPPRNPGP 


10433 


24334 


A 


10514 


286 


1 


SFLRKFCLRGVPSRVKCQSA\LLGGASQ 
LGYSGVRYPLEEAVCPFSHLHLHAGRTT 
TLFKAVRQEHLSLQRILLPLVGRVGRPN 
DSRCDRVIGH 


10434 


24335 


A 


10515 


28 


428 


RFVCST I KVLRDLS SDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10435 


24336 


A 


10516 


263 


2 


I KKPADSE SAGAL I LDFPAS QTVRNS FL 
LLTSHSVCYFCYSS PNGLRQQLKI I IFF 
FFLRQSCSV\SRLECSGMILAHCNLCLP 
GSNN 


10436 


24337 


A 


10517 


62 


292 


PATL F S GV I LP P TNS I NAF S GS D S KS AH 
K/ CEMLGTENFNFFTCHTHI I AEHSNS T 
HTHTHTHTQTHTHTHTPLLFYE 


10437 


24338 


A 


10518 


172 


2 


RGKKIFFPLEKGKNLWWITALFFFFFE 
TDSUSV\ TRliKCSGTXhYUCSLCLPGSS 
DF 


10438 


24339 


A 


10519 


42 


194 


ILVETF\CLRFVSLENLVKMRFLKLKKK 
KKKKKKKKKKKKKKKKKRKKKKI 


10439 


24340 


A 


10520 


120 


1 


I FAKRWPI KFFPFFFFFFETVFHS \ VAQ 
AGVQWCDLGSP 


10440 


24341 


A 


10521 


202 


375 


SFLWKLHLSGVPGHVRCQSALTGG\ASQ 
LSYSGVTDTLEEAVCPFSDLKLRAGRTT 
TL 


10441 


24342 


A 


10522 


272 


402 


PKIFYVTKKAWNYYPYTITEYTCSF/IR 
KFFIHI ETKYEDNKGSN 


10442 


24343 


A 


10523 


147 


439 


L I L Y VN Y I S RKS TV YD S RHP CT Y YL I P E 
KNCYKYHCNFLCT/SSSIPFPTPSTPSR 
AS PKKKKKKKKKKKKKKKKKKKKARGG 


10443 


24344 


A 


10524 


7 


415 


RFVCSTIKVLRDLS SDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10444 


24345 


A 


10525 


321 


442 


TSFSFLFFF/LETE FHS VA\RLE CRGT I 
LVHCNLCLPGSSNS 


10445 


24346 


A 


10526 


477 


3 


IEGKLPNRKDIHTKNPSVCRHHQRPKVD 
KT TKMGKKQ S R KTGNS KNQS AS P P P KE R 
SSSPAMEQSWTEKDFDKLREEGFRGSIY 
SKL/REEIRTNGKEVKNFEKELDEWITR 
I TNAKKSLKDLMELKAEAQELRDECTSR 
AASANCVAVPQAFDRSHGVQ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C-Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R= Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *~Stop codon, /=possib!e 
nucleotide deletion, V=possibIe 
nucleotide insertion 


10446 


24347 


A 


10527 


265 


411 


RFVCSTTKVLRDLSSDRSNPGRCLSTSN 
S/SPVRKDKRNKAYPTK/RPSPRN 


10447 


24348 


A 


10528 


81 


390 


RFVCSTIKVLRDLSSDRSNPGRVLSTSN 
SSLY / EKDKRNKAYFTK/ RP S PVND 1 1 


10448 


24349 


A 


10529 


253 


23 


RI FLFYFHS FLRQGLARAEGQ PHNHS SL 
QPPPPG\SGDPPASASRAARTTSAYHHI 
PLFFFRSYCVAQAKIINMLSIiL 


10449 


24350 


A 


10530 


10 


398 


RFVCSTI KVLLDLS SNRSNPGRFLS TSN 
SSLY /EKDKRNKAYFTK/ RPSP 


10450 


24351 


A 


10531 


289 


423 


RFVCSTSKVLRDLSSDRSSPGRFLSTSN 

ssl/qrkdkrnkayfter 


10451 


24352 


A 


10532 


152 


401 


PHCS ihsqvriqanvhkghrqrtygsvi 
philplhvlkktfslrdfhfsvslkknl 
vltclhlfl/gvrtprndpfvsmmllft 


10452 


24353 


A 


10533 


2 


515 


LDRISLLLPRLKCSGPVSAHRNLYLP/G 
FKRFCSLGLPSSWDYRRASPRVAKFVFL 
VDTGFHHVGQAGLKLLTSSDPPSLTY/P 
QCWDYRREPHLAAF 


10453 


24354 


A 


10534 


469 


566 


PTKNTKI SREWWCVPV/ 1 WE AE ARE S LE 
PGKQRL 


10454 


24355 


A 


10535 


354 


47 


KPKNLPGGGGPPFYSSFSKGKKIPLPWK 
AKVPINQILPLPPHPGGGKKTPFPKKKK 
KKE RKKKKMAT VQ KGM / PHKYYHGKTGS 
YNVI QHAVGNAVTNRTRGS 


10455 


24356 


A 


10536 


1 


409 


TPKKRFFP KKP KG F FF P P P KKKKKI F S P 
PPKFWPPQKNFKKAPPPFFFFFFFFFFF 
FFFFLKSNPFFFFF/SKNPILKPPLPPL 
FRVFP P P KKKKKKDP 


10456 


24357 


A 


10537 


204 


435 


TS GS L Y KLLGMGD LGLGKT T I FKQGGHQ 
NS S \ GWH PE AT TG / VNWAF KGLHWD P KA 
LAHLQLWNIAL/ QER/FGKMTKAGY \ 


10457 


24358 


A 


10538 


208 


382 


RFVCSTIKVLRDLSSDRSNPGRFLFTSN 
S SLFER/DQRNKAFFPK/ RP S PVND IIS 
T 


10458 


24359 


A 


10539 


63 


409 


GWRLTNFPVGKTPSTSGTGLLFIFLLLN 
LMGRLKPSS I SMKREFRKQNRGQKC / E F 
MKKKKKKKKKKKKKKKKKKKKKIGGGAF 
LKIFRGGPNFPGGEKTFFFFLGGGIKSR 
GGGFG 


10459 


24360 


A 


10540 


3 


1322 


RG YAW PNGAL PAS T VP CGFAACPGE FL C 
SVNGRQAPAGSSGFLPPVPSLCPHTVCR 
ATFQCKEDSTCISLRRFPWGSETHLLCP 
APIPSVPGVPCGTFTFQCEDRSCVKTLS 
ASPLPLLIiCPPHHLPPSPDCGLQGPSSR 
I VGGAVS S EGEWPWQASLQVRGRHI CGG 
AL I ADRWVI TAAHCFQE D S MAS T VLWT V 
FLGKVWQNSRWPGEVSFKVSRIiLLHPYH 
EEDSHDYDVALLQLDHPWRSAAVRPVC 
LPAR/ SENREQGLQ/ CSWITGWGALREG 
GP I SNALQKVDVQL I PQDLCS EAYRYQV 
TPRMLCAGYRKGKKDAC/QGESPG 


10460 


24361 


A 


10541 


154 


3 


GVGNFFFFFLRWSLTLAQAGVQWRSLGS 
LQP/LTSGSLFRQFSCLSLPSTRP 


10461 


24362 


A 


10542 


3 


406 


LMVLWTAHLPALAPGSRTSLLLAF\ALL 
CLPWLQE \ AGAVQTVPLSRLFDHAMLQA 
HHAHQIiAIDTYQEFEETYIPKDQKYSFL 
HDSHTSFCFSDSIPTPSNMEETQQKCNL 
EIiLRISLLIilKSRIiEPVQVIjTSMF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 

NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=0Lysine, L-Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \= possible 
nucleotide insertion 


10462 


24363 


A 


10543 


3 


414 


HS S AHAYGT YE S SGKRKYLQRP S RTAEL 
KKAL KE KKNRLLLQ QR S KS VT S S S S S S R 
DSSARDSSSEREETSTSSSSEDSDTDES 
SSSSSSSAYSTNSSSSFDSDSDSSS\SR 
GRSTSTDSS S ADD / S TNEE ESELE 


10463 


24364 


A 


10544 


240 


509 


TFTS F 1 1 GTPNLGE FCKCNNFG I / GKFR 
MKVACQIETLGILSFFFETESRSVTRLK 
CGGAILAHCKL 


10464 


24365 


A 


10545 


275 


2 


GCSRP FDTAI PLMGP YPEEKKLLFEKNK 
/DTCTCMFIAAHFPVAKMWIQPKCPSIN 
EWIKNPVTHIYIHTHTYMCIYIYTHICV 
CVIHIYI 


10465 


24366 


A 


10546 


233 


2 


WS WMAI QS VLPQTAFTELKNKYS CIQET 
PILFYFLKQSLPLSPSLECSGVI IAHCN 
/LQ/RPGSTTFPASASKVERTTGA 


10466 


24367 


A 


10547 


2 


420 


KKKIPNRGGCLKRDTKAFLKTPPMLQRR 
TPPAFPKGPGGFLPDFNPLPPSKGNPPI 
YSWERFFPRFFKNIWAPVPKAMLLSFFF 
FS I FWRTGSRYI AQAGLELLGS S YPPAS 
CIjPESWDSRR\DHRARPTI 


10467 


24368 


A 


10548 


190 


1 


AASTFFACVCFRLFFVFVFETES PSVAQ 
AGLQWHSHSSLQPPPNLG\SGNPPALAS 
PVAGNC 


10468 


24369 


A 


10549 


155 


1 


ARGEKYFLEIGGGFFFFLLVFFLRQSHS 
I AQAGVQ W CD \ LNS MQ P LP P RVK 


10469 


24370 


A 


10550 


223 


392 


RFVCSTIKVLRDLSSDRRNPCRFLSTLN 
/SIjAVRKDERYKAYFTK/RPDPVYDIIS 
T 


10470 


24371 


A 


10551 


183 


3 


WQKLLFLFGTESCSVARVGVQWRHLSSP 
KPPPP\GSSNS/PASASRVA\TGAHLHA 
RLIFVFL 


10471 


24372 


A 


10552 


297 


10 


TDEELLCMEEKRKWFF \DMAS I PGEDTV 
NTDEMTKT/DLEYYINIiVDKVSAGFERT 
DTNF\ERSS TVS KMLSNS I TCYRKI/F/ 
RERKSQQMWQTSFLSF 


10472 


24373 


A 


10553 


509 


3 


LLTDRTLSCRSVGVPCSVRCQCA\LLEG 
AS QLG \ S RGS G VRD P LE E AVC P FP DLQL 
RTGRTTALFKAVRQGHLSLQRLLLSF\V 
CLCPAPRGGAYRGTQASLSCGGLHPVRA 
SRLLCLPKQAWAMVGTPPP\TSLPPCSL 
ISDCCASSQRDS/VGVGPSKPGAGYNLV 
VRRF 


10473 


24374 


A 


10554 


262 


453 


GWGILTANLLVFGGGSLSISERIFFSPL 
ATGF CS VAQAGVQW \ L 1 1 AHCSLE FLGS 
SD 


10474 


24375 


A 


10555 


112 


372 


KKKGGGPPFFFFFIFFLIRQRKAKLWL 
AFNRRGLRKPD/ I FYYAGVGKI VKKKYS 
G F F I E LVTT CGKN YLMCALAV I D PRDSN 
I IRS 


10475 


24376 


A 


10556 


50 


358 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSIiY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10476 


24377 


A 


10557 


203 


350 


QLHAD KVS RFATHAE CRGV I MAY FNLNL 
PH/FKRSSCLSLWSSWDYRHAP 


10477 


24378 


A 


10558 


98 


387 


I QDTDL I RGV YR YRD KKRARE KD S P PER 
EGERERE IEKQRKKER/ ERKKERKKEST 
HQKE 


10478 


24379 


A 


10559 


237 


3 


EVFFKNIFEIESCSVTQARAQGQDHGSL 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C— Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=G!ycine, H=Histidine, I=Isoleucine, 
KHLysine, L~ Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q— Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














QPPTPG\SSSPPTIiASQSAVITGVNHHT 
QSGLTFQSPKQKALGKMCISSGG 


10479 


24380 


A 


10560 


37 


359 


RLKCGGPVIAYCKLEILGSSNTPTSTSW 
VAGTT/TCHHAQ 


10480 


24381 


A 


10561 


170 


3 


RPFFIiPWGGKKFQGVSFFFFFFLVFFEI 
GSHSVP\RLKCSGTITAHCSLDLLGSSN 


10481 


24382 


A 


10562 


1 


388 


LPEPLMVLRTANLVAMAPGSRTTLL \LA 
FALLCLPWLQEADAVQTEPLSMLYDHAM 
LQAHRAHQLDIDTYHELEETYIPEDQKH 
SLLHDSQTSFCLSDSIATPSNMEETHQK 
SNLEMVR I SLLL I E T WLE 


10482 


24383 


A 


10563 


224 


392 


CVD P I L KWF P LRTATRQG C P L S \LFNIV 
LEVLATAIRQEKE I KDP / Q I GKKE VK 


10483 


24384 


A 


10564 


185 


2 


RYFGGPDSPQNGKTVFNPLFFFFETRV\ 
HSCCPNESNGTISTHCNLCLPGSSNSPA 
SASQVA 


10484 


24385 


A 


10565 


175 


324 


APGYGTKYRVILCKILLLRRLRHKNHLN 
PGGRGCSEPRSHHC/ITPAWVTE 


10485 


24386 


A 


10566 


3 


367 


HELLWSTIKVLPYLRSDRSNPGGVISTS 
NSSLC/EKDKKNKAYFTK/RPSPVNDMI 
ST 


10486 


24387 


A 


10567 


128 


391 


GETCEGVYGPQQGGKQTWKVFHS KKDEG 
PCSKTWDVGLGSSPQFIFLKKVRFFFFF 
FETESHSVTRLECSGTPGT/HCNLCYPG 
TS 


10487 


24388 


A 


10568 


251 


485 


SFLWRFPLKGVPGHVRCQSA\LLGGASQ 
LGYS GVRD PLE DAVCLLS DL KL CAGRT T 
ALFKAVRQGHLSLQRFLLPLFSY 


10488 


24389 


A 


10569 


176 


3 


FVPPILKKMGPFLKKKEGKPFFFFFSEM 
ESRSVT\RLECSGA/TISAHCNLCLPGS 
SNS 


10489 


24390 


A 


10570 


212 


386 


RFVCSTIKVLRDLSSDRSNPGRFLFTSN 
SSLY/EKDKRNKAFFTK/RPSPVNDIIS 
T 


10490 


24391 


A 


10571 


134 


489 


SLFFLPSPRDRWSLTQSTEMPPSWKFPG 
REIRSPLLAMLHSIiLLQPTAFLCPSSTL 
LLSLKPDSLRTGSLFLSFFS FLYFEACS 
VA\RLKPSGTISAHCNLCLPSSSDSPAS 
AYIVSG 


10491 


24392 


A 


10572 


1 


470 


GQSRGIPLLLTLDIiEKPVSLLLSVTNLY 
SKNSAQFSTILQTKSFPATFTPSPSIPL 
SSAYFFFFSDRVS /LLSPRQECSGLNLG 
AHCRPQPPRFKRFSCLiSPSS\SWGLTRR 
HTTTPGL I FC I F / S VEAGFHHVAQTGLE 
LLSLSNPPASA/ FPKCWDHRR 


10492 


24393 


A 


10573 


242 


410 


VMLRMQIYFSLHNPIFFFFETGSHFIA\ 
RLECSGMIPAHCNIiCFPGSSNAPASASQ 
V 


10493 


24394 


A 


10574 


236 


1 


ARTF F I H I Ij KAS CGDNAI TG VLWHRAT W 
CPSVLLDRDSLSERVCEILCIHAC/VMR 
IHEHTHT Y I YAHTHTHTHTHTHT 


10494 


24395 


A 


10575 


241 


35 


RSSYLFIFNFFVETRSVAQPGIiKLCCSA 
WAGTPG\SSDPPTSSSQSAGI IGMSLNL 
FIiQWLI IDQAGNTN 


10495 


24396 


A 


10576 


294 


72 


FKNFKSPLFSLFFFLILGTRFCYVAQAG 
LK\ SWPQGDP S ALASRVAGTRGVHRHTQ 
LQ VS FN Y YKVLiAMHS G SQL 


10496 


24397 


A 


10577 


1200 


1625 


PDGWSYSSGHSWLPEEASNGRKAWSSCW 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
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ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine ? l^=Leucine, M=Metbionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine ? 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














VPPQLS PPQAPPS PATHLGPTPPLA/ S P 
PQAPPPPATHLGPAPPPPLSSAP\PPPD 
TLLGPAAPPP P S SGPDP PRYS P /EVP PQ 
LSPPQAPP 


10497 


24398 


A 


10578 


226 


325 


ERHTHTPHSHTRSHTRLTHTHA/YTHTH 
AHTHA/YTHTHAHTHFPS 


10498 


24399 


A 


10579 


108 


320 


TINVFLKVNVMKLSYLRFNTRNCSFFFL 
AGVQGGDHGSLQLQPPG/SSCDPPSSAS 
RVAGTTGAYQHTQL I F 


10499 


24400 


A 


10580 


1 


288 


ARGERERERERERERERERERERERERE 
REREREREREREGLSFFFSRSRGGGKLW 
CVC/RPSARPRERERHAVFSAPPSLQKK 
KKVFGGREGTHSLFYPPCGNSLRR 


10500 


24401 


A 


10581 


1 


476 


REWGLALSPRLECSWI IAHCSLGSLGL 
KRFS CLRLPS WKD YRCAP P CP S LNLFFV 
ET\GPHYVAPGWSLLLGSSD/PPPLASP 
KCRDYRHE 


10501 


24402 


A 


10582 


210 


346 


PHCS I HS QVR I QANVHKGHRQRT YGS VI 
PHILPLHVLKKT/FSLRDF 


10502 


24403 


A 


10583 


316 


461 


LPNILGWLFFVSETGFCSVTHAGVQWC 
DEHGSLQSQTFG\SSDLPTSD 


10503 


24404 


A 


10584 


172 


383 


NLDLYLTS YI KINS KWI I DLNVI AKT I K 
BLE \ KNVAGENI PDLGLGKEFKD\KTQK 
AbS \ TNKKIDKLNFTR 


10504 


24405 


A 


10585 


273 


374 


GPGTEAHTRNPSTLGGRGGAIT/R/GQE 
FKTSLAKM 


10505 


24406 


A 


10586 


150 


344 


KLKCKI 1 1 KLGIKI S FAVI VWSIGFHF 
CYLGWVQW\PIRTHPGSSDPPASASQSA 
GITSMSHHT 


10506 


24407 


A 


10587 


199 


3 


SFSYIRVHLGHIKHTRHKASCEVYANCK 
AKGI ITGIKKK/RFWPGTVAHACNPNTL 
GGRGRW I TRL V 


10507 


24408 


A 


10588 


216 


1 


PRGAPTCMRGQS AL \LGGVLQLGYTG \ V 
RDPLEEAVGP I SELKRCAGRTTALFRAV 
RQGRLSLQKFLIiPFLjV 


10508 


24409 


A 


10589 


190 


3 


I S AQP WAAFEKRF I F I YRAPA/ L.KFM I F 
FSPFKGVPFFFTFFFFFEMESRSVARLE 
CSGVISAH 


10509 


24410 


A 


10590 


369 


40 


KTERNS ININKNDVHTKTPSKGRQHQRP 
KAD KS RE MRKNQHKKAENS KNQNP S S P P 
RDHNS S PARKQNWMENEFDELTEVG/ FR 
KVGNS S ELKEHI LT P CKE T KNRTRGW 


10510 


24411 


A 


10591 


114 


398 


RFVCSTIKVLRDLS SNRSNPGRFLSTSN 
SSLC/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10511 


24412 


A 


10592 


132 


1 


ILYLWCVCVCVCVGVCVYLLILV/CKVL 
SGLGEPALSVGAFQMS 


10512 


24413 


A- 


10593 


3 


287 


IGFITLNVGLYLYLLCYIYVYFFQVNYI 
VFILVLAVFIiLKNRLLRYNTALYlSIEEVT 
REKI SHYF I / NS YIHTVGKINKND 


10513 


24414 


A 


10594 


2 


363 


RAEVGVMAPLYLLLLBSGALGLTDTWAG 
S HS LR YFS TAVS RP GRGE P RY I AVE YVD 
DTHFLRFDSDTAIPRMEPRKPWVEQEGP 
QNWE \ WTTGYAKANAQTDRVALRNLLRR 
YNQSEAGE 


10514 


24415 


A 


10595 


156 


3 


NGLELDSDDGCIALLFFYFETEFCS\VA 
QAGVQWCDLGSLKPQPLGFKPTRP 


10515 


24416 


A 


10596 


80 


306 


MDEELLFRDEQRKWFLEMESTGEVAVNI 
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nucleotide 

location 

correspond 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine 7 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














V/EMPI KDLG/ Y YRNLVDKAVPGCERVD 
FNFE I SSTGWVWWLTPAI PVL 


10516 


24417 


A 


10597 


232 


389 


ILCWVFCCCCCCCFETEFLVIQAGVHWC 
DLDSLQ/SPPPGSSDSPASAS 


10517 


24418 


A 


10598 


463 


154 


MGQGKPRFPLFFHPFYI FFFLKQGFSLS 
PRVQGRAKKVPGTLASWGQKILLPQPPR 
/RVRQENCLNPGGGGFSEPKIPPSFPAW 
GEKGGSLLKNLKKKKTKHIC 


10518 


24419 


A 


10599 


208 


38 


LGPPQGFFKTAPPFFFFFETESRSLAQV 
GVQWWDLGSLQLPPPG\SSDSPGLSKKF 
LL 


10519 


24420 


A 


10600 


200 


3 


CQPELS CMRC \ RQS LLGGLS P SGGME VR 
DPVEEAVCPLAELKRCAGRALLVRI CCS 
LQSQQAGTFK 


10520 


24421 


A 


10601 


73 


427 


RVVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPGNDIIS 
T 


10521 


24422 


A 


10602 


193 


401 


GEVSLLSPRLECNGVISAHCNFR/LPGF 
KRFSFFFIj 


10522 


24423 


A 


10603 


181 


395 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRMKAYFTK/RPSPVNDIIS 
T 


10523 


24424 


A 


10604 


384 


10 


PRRPGGECVHCLGDSLVTDRRYSGLRSS 
DQTLSGDPMEAVPWSNSLSCSHYRLKPA 
SGITWGCKQAGFFSFFLFFKMESHSV\T 
RLECSGAISVPCNLCLPGPSNSPAPGGS 
LGPRSSRPPWTT 


10524 


24425 


A 


10605 


215 


379 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLC/EKDKRNKADFTK/RPSPVNDII 


10525 


24426 


A 


10606 


1 


163 


QTHRE PAMVLS PADKTNVKAD \ WGNVDA 
HAGEYGAEALERMFLSFPTTKTYFPR 


10526 


24427 


A 


10608 


283 


2 


IQKHGLI IRCPQETPFKNNDVGMQKVKG 
RGE KSIMQIL/YF F FFLNS / HL S PR LE / 
CS GA I S AYCNLRL S HS S D S PA / VDGT I G 
ACHHTQLIFLFLVE 


10527 


24428 


A 


10609 


144 


327 


CSWSWFCGCVRVSTLIKKKNFFFEMEFR 
S\VAQAGVQGRDFGILKPPPPG\SSDSP 
ASASRV 


10528 


24429 


A 


10610 


137 


1 


HKCPSTDEWDERV/ WCMYTKEYCSALTK 
EE IMS FGKTWMELENIM I 


10529 


24430 


A 


10611 


201 


334 


SFFWRFCLRGVPSRVKCQCA\LLGGASQ 
LGYSGVRDPLEEAVCPF 


10530 


24431 


A 


10612 


146 


2 


KTPLKKGLKRVPFFFFFETESCSVTQAG 
VQWHN I GSLQVP P P \GSRHS P 


10531 


24432 


A 


10613 


205 


259 


TTMPGLSFCTRTKSNKNLSFLTLDGSAR 
VDLFLYLLLFRTIFQNF/ CGRDKVLLCC 
PGWS QT PELKQL I CLDLPKYWDHRYE PL 
CLAYPFALGQNLTKI 


10532 


24433 


A 


10614 


153 


2 


GVLMVSDGVLSMLANLFFSFLFF/LFET 
QSHS I \SRMECSGVI SAHCNLCLP 


10533 


24434 


A 


10615 


213 


8 


KPPFFFPFFFLFFFLRWGSHCWSGY/C 
KGT 1 1 VHCGLELLGS S CS P PAFQVSG I T 
GLCYCAS IF I FEE 


10534 


24435 


A 


10616 


146 


1 


I LT IRD YWVKI TLFFFFE P KFHS / VLPR 
LECSGAISAHCNLCLPASSDS 


10535 


24436 


A 


10617 


208 


2 


NNFWFSSSGKYGQYFSSETESRSVTWGH 
DLYSLQPPPP\GSSDSPASASQVAGITG 
SCHHAQLILVFIVE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 
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NO: in 
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beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I=IsoIeucine, 
K=Ijysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y~ Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \-possibIe 
nucleotide insertion 


10536 


24437 


A 


I06I8 


120 


2 


SFFFFFETEFRSVAQAGVQWHDLSSLQP 
PPPG\SSDSPSL 


10537 


24438 


A 


10619 


162 


1 


S FHWKLRLRGAP G CMR CHS A \ LLGG VS Q 
LGYSGFWHPLEEAVCPFSDLTLCAG 


10538 


24439 


A 


10620 


193 


419 


TFFFFETESRSVAQAGVQWRDLGSLH/S 
QPGQQSETP 


10539 


24440 


A 


10621 


313 


460 


KP GLWRGTR S Q KQ VL FFEPESHS VA \ KL 
ECGGTISAHCNFCLPRSSCFSA 


10540 


24441 


A 


10622 


243 


505 


PTGCPKQEQVQAMLRPWFFFVFETQFN 
S/VPRVECSGTISAHCNLRFPGSRDSPA 
LASRVAG I CRR / CATTAQL I FVFLiVETG 
FCHLVG 


10541 


24442 


A 


10623 


67 


430 


LARRTWKNQHSSTHGCDLFFFFFFFFFF 
LIKKGVFFFPPEGGHGHRFFFIMIKFLlSr 
YKRFFFCLFEDVLMISVGPPLAFFFFFF 
WRGGFSYLLLRCV/LTFLALGAPLFAPM 
FFLFLFGEEL 


10542 


24443 


A 


10624 


3 


207 


PGGQLGSECSGVRMDEDVL\TLKSLIIG 
ESGVGKSSLLLRFTHDTFDPELAATIDP 
S SWNPARNAATP 


10543 


24444 


A 


10626 


151 


380 


KMLFGQVQWLTP I I PALWEAEAGGSLE \ 
LRSSRPALKRSETSSPPSLPCQKKKKKK 
KFPRAGGQMLEVPFPGRVGAG 


10544 


24445 


A 


10627 


210 


3 


NFCQVKGFWKQKS S Q I KGFGKNKKKKGG 
SFFFFFFETESHSVT\RLECSGTI ivhc 
SLDFPGSSDPPTSA 


10545 


24446 


A 


10628 


237 


2 


LCFLSGASCSPYIDVYTEVNGVNPTFII 
LTNKTNTFGPLFLIIFFFGETESLSIA\ 
RLDCSGAISTHCKLCLPGSRHSP 


10546 


24447 


A 


10629 


1 


352 


RGPLS IQDYISKALFFLLLFGFLLLLLF 
FEVRSCSLTQAGVQWCNHSHISSLQSPT 
PG\RSPSDPPASAS 


10547 


24448 


A 


10630 


48 


200 


AIKPKNFLFFSETKSSSVTQAGVQWRDL 
SSPQPPPP\SSSDSPAPAN 


10548 


24449 


A 


10631 


209 


376 


SFLWKFCLKGVPGCVRCQSA\LLWGASR 
LG YLGVRD P L YE AVC P F S DL KL CAGRTT 


10549 


24450 


A 


10632 


141 


1 


AGFFFFFLFFFLETESRFVA\RL\QCSG 
AI TAHCNLCLWGSSNYHAS 


10550 


24451 


A 


10633 


18 


522 


PLYSLTKRHTPREA/ KDNLKSTQLLSVI 
DAI S EG P I E G P VDGLKS VLLNS T PVLDT 
EGNTNISGVTWFRAGEQEQTPPEGFES 
SGSETVLGTEVKYDTPITRTITSANIDR 
LRFT FGVQALVETTS KGDRNP SE VRLLV 
Q I QRNGGWVTEKD I T I KGKTTSQ YLAS V 
V 


10551 


24452 


A 


10634 


127 


2 


NLSPLFFFFETGSRSVTQAGVQWSDLGS 
LQTPPP\GSRHSPTK 


10552 


24453 


A 


10635 


336 


452 


RIFFFBFE TEALS VAPAVAQWFDLGSLQ 
SPPP\GSSESPA 


10553 


24454 


A 


10636 


259 


3 


GTRIFFPFNSSPLSFPPVQKKSVSPFFF 
SF/LFFFFETESRSVT\RLECSGAISPH 
C KLHL PG S RHS LAS ATRNVLRP LWR I HA 
SCA 


10554 


24455 


A 


10637 


96 


413 


GDNTSQHSAI L INYF I FFFTFLFF / ETG 
SHS VAQAVRLE YS S VI TAHCNFRLRGS S 
NPPKKKKKGGAVLKDPWGGQSLPGLATY 
YFFPYRGANKNLLGDFWEGPLFV 


10555 


24456 


A 


10638 


180 


473 


CYMGRKAWLVLSNLLTLKKKKKKKKKKK 



1191 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
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nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 
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USSN 
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26 
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beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I— Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKKKKK / EMPVKQKNRGRALYKK 
KKGGRPFPIFWPFFFFFFLGGGKKHPGG 
FFRKKFFFGGGKKKK 


10556 


24457 


A 


10639 


312 


480 


ETVDSTSGEDDVNIAEMATKDSEDYINL 
V\GKAVAEFERTDSNFE\RGKMMSKSI 


10557 


24458 


A 


10640 


2 


516 


MKPLENLSQTASLARGATLLRPVLRRLC 
GLPGLQRPAAEMPLRARSDGAGPLYSHH 
LPTS PLQKALLAAGSAAMALYNP YRH/E 
ERPRISTSTLDLGKLQSLPEGSLGREYL 
RFLDVNRVSPDTRAPTRFVDDEELAYVI 
QRYREVHDMLHTLLGMPTNILGEIVVKW 
FEAVQ 


10558 


24459 


A 


10641 


56 


455 


PLCSYIQFIYHSKFNSNQISLVHTIPLS 
KLFGYFSQLYS I YS YLCQHF IHLLS IFI 
TVF C I VF YWVFVHL I S VPMYI CVCLCTN 
VCVCV/CCLW 


10559 


24460 


A 


10642 


272 


33 


GRWFLFLGPAKYFLTGGRFPSDFFGPLK 
INPPFFFFFFFDTESHSVAQAGVHWHNL 
GYLQ/SPPPG\SSDSPAPAEFHHTV 


10560 


24461 


A 


10643 


285 


1 


MGNFLKRFFKNEKKFFFPILRAHPFNFF 
PGKSPFKTPRRALPLGVPPKHPFFFFFF 
FSETESRSVARLECSGMISAHCNL\NLH 
LPGSGYS PAL 


10561 


24462 


A 


10644 


100 


466 


FLLKFRLREVPSRVRCQSA\LLGPTSQL 
G YSG VRD TLEE AVCP F S DLKL CARRTNT 
LFKTVRQGHLSLQRFLLPFVQLCPVPRG 
GVYRGRQAS LS CSGVHP VRAS P PLCLP K 
VPPPS 


10562 


24463 


A 


10645 


260 


460 


LKPHAERETIDKGRLPYYSFFFFETESC 
FVAQAGVQWHDRYSLPPPP\ PGSEGSRA 


10563 


24464 


A 


10646 


360 


504 


Q IGHI CAYVEKTELRL I FFFLFFFFRQS 
CSVT\RLECSGTILAHCNLCIi 


10564 


24465 


A 


10647 


241 


45 


WEVEVPRVAPCHILKRDYYFI ILFLFLF 
FEMESYSFT\ELECSSAISTHCNLCPPG 
SRDSPGIPPH 


10565 


24466 


A 


10648 


76 


462 


FLWEYQEKAHILWSLFSKLILSRSRFTV 
KEKVRQKPGILFIYLFIFEMESCSVAQA 
GVQWRDLGSLSPQKKKDS/DQSKAITFV 
EGINSKRGGWTGPHF/HCSLKMIFLI 


10566 


24467 


A 


10649 


3 


359 


QTQRE PTMELTPADKTNVKAAWGKVGAH 
AGEYGAEALERMFLSFPTTKTYFPHFDL 
SHGSAQVKGHGKKVADS LTNAEAHEDYM 
LNALTALNDLHAHKLWVDQVNFKL\LSH 
CLLVTLA 


10567 


24468 


A 


10650 


266 


3 


TPEKKKKIGGLGAPFPGYPKKIWPPNFF 
FFKKTKFFFFQRVGWAPPPKSFFFFFFF 
EVESCSVAQAGVQWRDLDSLQPSPP\GS 
RGSP 


10568 


24469 


A 


10651 


278 


461 


KKIICGFFFDTGSCFVAQAGQQWRDLRS 
LQSPPPG\SSDSLTSASRWGMSRHMRA 
VF 


10569 


24470 


A 


10652 


161 


456 


VFFFPGENIAFVENKTTLSYHFMFTRMT 
AAKKMENYKGCWGWETIGSFMPCWLESK 
RVQSLW \KVWQFFKWLNVKLL YGLAI PL 
TG I CP KKL KKYVQG Y 


10570 


24471 


A 


10653 


91 


486 


PCFNHGHTTWYMYFFSLSLILFFVFSLK 
PVILAYKTFSFPITIRVLLQRRHRQKFV 
LDGFNGVDQNNILGS ICVSLIKIFFLTE 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














S CFVAEAGMQWCNIGSLQAPRP \GS CHS 
PG1SPPVLG 


10571 


24472 


A 


10654 


267 


470 


NLHMRCEKEQNFNKLSFKDLFGLGWLWF 
FIFCFWRRILTLSPRLEC/RG/MIFAHC 
NLCLPGSGHSPASA 


10572 


24473 


A 


10655 


775 


1401 


TFPSQIRYGLRKRPNSLTLSPRLECSGA 
ISAHMQRPPPGFTPFSCLSLPSSWHYRR 
PP PRPAI C VCVCVCVCVCLVETGFHRVN 
QDGLDLLTS /S / I PPASA/ FPKCWD YRR 
E 


10573 


24474 


A 


10656 


287 


454 


LPLIYFLYFYETECHPLAQAGVQWRDLS 
SLQS PPP \ GSNDS S ASAS PTVPTSAM 


10574 


24475 


A 


10657 


212 


64 


LFLS KFYFFEMES CS VAQAALQWCDLGS 
VQPLP\PGSEDSPASAWGYLD 


10575 


24476 


A 


10658 


368 


473 


G F I DHTRRERPE CP/ LTDE W I KKM WH I H 
TTEYYSAL 


10576 


24477 


A 


10659 


358 


1 


ILAVFCFCGSAPGFEGGKLTPFFNRGRE 
TFLALVKTP PPGKARGPVF I S KNKKI PE 
FKQPPNPNPNSFFFFFFETESHTIARAG 
VQWHNLCSLQSPPPG\SSDSPGRWSLQR 
TEIAPL 


10577 


24478 


A 


10660 


2 


235 


KRDLIRHYPKEDIYMANKYILK\CSTLT 
MHTETLj IRTTMRYHL I LI KVT I FKKTDN 
MTAVGVHI CNPNTIjTGHGRG I A 


10578 


24479 


A 


10661 


9 


538 


CVTVRIPSRPTRPDSSDRSNPGRFLSTS 
NSSLY/EKDKRNKAYFTK/RPSPVNDI I 
ST 


10579 


24480 


A 


10662 


374 


38 


SFLWKLHLRGAPGCIRCQSA\LLGGVSQ 
LG YS GVRDPLE EAVCP FSDLKPRAGRTT 
TLFKAVRQGRLSLQKFLLPFVQLCPAAR 
GGVYRGRQASLSCSGLHPVRASRPLCSR 
R 


10580 


24481 


A 


10663 


268 


47 


ALIPLSSLTI SAFHLLLLETVS TSSPKM 
ECSGAITSHCSFNLPDSSPSPTSASR/V 
IGTIGARHQAQLMFIYFC 


10581 


24482 


A 


10664 


64 


451 


FSSERKSHMSLTLNQKLEMIRLSEEGLS 
KAKVGQKLVRLHQTVSQWDAKEKLLKL 
IKSATPVNIGMIKQHNLIADIEKFGMIW 
TDCQTSHRTVLCQRLIQSK/ALTLFNSM 
KAERGKEAADEKLEVRRG 


10582 


24483 


A 


10665 


355 


474 


ILFYFIFFETESCSVAQAGVQWRDLGSL 
QAPPPG\SRDSP 


10583 


24484 


A 


10666 


317 


481 


GHTCPWQTFFFFLFRDRVLLHHPGWSAV 
TQSWLTAALT\FGPKRSSCLNLLNDWD 


10584 


24485 


A 


10667 


57 


329 


VKNTQWGKDSLFNKRV\FKNWASIYRRI 
KLD \ LTS YAK INS KWI KDLNVRLE I VKV 
LQVEYPSFKILGNGSVLDFVFFYSGIFA 
LHLMGEHP 


10585 


24486 


A 


10668 


86 


468 


ENYKI PLMGGKNFLLFPS I PP YFFFFSR 
LGLTLLPRLKCSGDHCSLQPRP PGLKRS 
SCL\GFPKCWDYRNEP/CVPR 


10586 


24487 


A 


10669 


254 


25 


GSHI CKVPAAI YSNTCTS SEHGDGGWG 
GCSSGSTTHPSPVADSFFVWETVSL/S 
VTQAGVRWCDLS S LQPPPPGI 


10587 


24488 


A 


10670 


206 


3 


YGRPRKLCNIGLKSGFLDPTHWGQHRVI 
SFFF I ME TDS RS VAQAGVQ WHNLGS LHP 
PPPG\SSDSPAS 


10588 


24489 


A 


10671 


20 


355 


GFTS QS E LL YS I D P I TRP TDQ VT I G I S A 
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Predict- 
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residue of 

peptide 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T— Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, >; =Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QS YCRVHVHNRVYD LD VE SGH PNGAPA I 
KGSVCSTIKVPGDVSSGRSNPGRFVSTS 
NSSLY/EKDKKNKACFTK/RPSPVNDI I 
ST 


10589 


24490 


A 


10673 


112 


359 


NKAQQCVHENHFKL KDANTLN I KVWRN I 
CHS S PNQ KK YGLA I LNLDKSGFRSRKDT 
GDEE/HFIKIKKSVIQEDFSIINIYA 


10590 


24491 


A 


10674 


317 


2 


TGPGFFPQIWKVFPPFPLKIFFTQKFLV 
SFWGFKTIKVGPFVFFPKGPQPRFFSPF 
LGVQAEKFFFFPFFFFFFFFCERGSHSV 
A\RLECSGAISAHCNLCPPGFK 


10591 


24492 


A 


10675 


240 , 


495 


DHRPEKTKSSTCQAGEVPLLGLFVPFRS 
STDWVRPAF I / MEG/HI CPTQS TNSNIN 
PVWKHPHRHTQDNWPNAWSPHGPVKLM 
HKLI 


10592 


24493 


A 


10676 


271 


484 


NPAGQTCRIKSFFFSFFETESCSVTQTG 
VQWHDHGS MQ P Q S LGP \ S D P P T SANS S A 


10593 


24494 


A 


10677 


364 


121 


KQEWRSLSHNALCNDQASPLPGSGHWK 
SKQKLSKAAPCAGSSKHKHL/HKECWKH 
THTHTHTHIHTHTHTHIHAHTKDRFA 


10594 


24495 


A 


10678 


354 


539 


FFFFFFIVFEESHAIAQAGVQWCHLSSP 
HLR \ S P P PGFN\DS PAS AS S AG ITTLS S 
SVRL 


10595 


24496 


A 


10679 


123 


853 


RWSLCHPRLE\CSGTISAHCK/Ij/RAPG 
FTPFSCIiSLPSSWDYSARHHARLIFFVF 
LVETGFHHVSQDGLDLLT/SG/ ippala 
/ FPKGWDYRRE 


10596 


24497 


A 


10680 


251 


30 


ASLGVS APLC IHVS PCP PNGDLGKTWWV 
GGGFFFFEMEFHS /VLPRLECNGTISAQ 
CSLCLLS S SNS PAS ASRI P P 


10597 


24498 


A 


10681 


203 


1 


NFPAPGKLGPPRDSLKTAPPFFFFFETG 
SRSVA/EAAVP/W/CDLDSLQPPTLPDS 
SDSPNSASGVAGITG 


10598 


24499 


A 


10682 


237 ' 


397 


DSLTLS PRLE CNGS I SAHYNLC / RLGS S 
NSPASAS 


10599 


24500 


A 


10683 


8 


388 


LYMCWFRPGFLAHNSHDHGYSLTLSCWG 
KKKKKKKKKKDS 


10600 


24501 


A 


10684 


201 


463 


I YWED I VQGI VADRLTSRKHNS KFLGE C 
LMFGQGFLFFEMDSHSAARAGVQWRSLG 
SLRPLPPG/SQVILLPSSDSPALASRVD 
EIAGV 


10601 


24502 


A 


10685 


151 


1 


P E KNRKT QKGAP PFFFFFETGSPS VAQA 
GLQWRSHS S IQNE PP\GSSDP 


10602 


24503 


A 


10686 


165 


3 


CVFVEDIISNFFFFFMETESRSVAQAGV 
QWREHGS \ASRVAETTGVHHHAQLIFR 


10603 


24504 


A 


10687 


292 


384 


NTEIGWVWWLTTVI PALWEAEAG/EITR 
SG 


10604 


24505 


A 


10688 


161 


365 


RPPCWIRTSRWCWRYYRFDGSTIKVLRD 
LSSDRSNPGRVLSTSNSSLY/EKDKRNK 
AYFTK/RPSPVNDI I 


10605 


24506 


A 


10689 


286 


3 


QPSSWSFRSVICAPAPFSGMGLSPELQ 
Q C I VGNFAS RYTMT KS S AVL FILIFSLI 
FKLEEL/REAPASLVSFLPPQMLRIKWE 
SEQWLVLLCDRG 


10606 


24507 


A 


10690 


169 


2 


SDTPAWWPRKHVFPPGFFFFFETESRFV 
T\RLECSSTISAHCMLCLPGSSDSPASG 
S 



1194 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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Predict- 
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Amino acid sequence (A=Alanine 
C= Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GJutamine, R— Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan 7 Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


10607 


24508 


A 


10691 


240 


406 


DGIRRLSFFFFFFFETKFGFVA\RLEGR 
GT I W VNVNLHL PGS RDS P AS AS Q VAG I T 


10608 


24509 


A 


10692 


343 


3 


QVPLNLSLHLKLGMLFPGCINGSLSQFL 
QFFAQISPYQTGFPSFYKTSIYHNSLHQ 
YSLSFLPYFIFLFFFSFFFFLKQSHSVS 
\RLECSGMVSAHCNLRLTATS ttqvqvi 
LV 


10609 


24510 


A 


10693 


196 


3 


YDAGHTKKTLHFPTVYPFVLFCWWECKM 
VQ P RWK I VRR / FLKKLNTKL P YD PA I P F 
LSVHQKELRT 


10610 


24511 


A 


10694 


245 


391 


KTDYQPGAVAHFGRPRREDHPRIiGVQDK 
PDQLEKPVSTKNTKLVW/SWWL 


10611 


24512 


A 


10695 


252 


389 


KTGGKVL F F FFE TE S RS VAQ VGVQWRAL 
RSLQPP\QPGTSDCPASAS 


10612 


24513 


A 


10696 


385 


1 


P PNKAKM I S S KDNKJSrLHWGKDTLLKrKW C 
WESWIVTCITMKLDSHLSPYTKINPKWI 
KD/LKTIKILGENIKKTVLDIGLHK\NM 
SKTSKAMTKILDLIKLKSFCPAKEIISR 
VNRKS T E W KKVFAS YL S N 


10613 


24514 


A 


10697 


296 


430 


KHIQARRGGS / CGN / RQHSGRPRRADHL 
RSGVREQPEQPGEKPHL 


10614 


24515 


A 


10698 


280 


468 


DYLCL/ SLrSIYLSIYLSIYLSIIHQFIY 
HLF I YHLSNLSS I S I FFTKWLS 


10615 


24516 


A 


10699 


15 


393 


RSVGVLGPVRCQCA\LLGGDSQLG\SQG 
SGVRDPLEEAVCRFPYLQLCTGRTTALF 
KAVRQGHLS LQRLIiLiS F\VWLCPAP IGG 
AYRGRQASWSCGGLHPVRA/SMLLCLPK 
EAWAMAGAPPPASLPPLS 


10616 


24517 


A 


10700 


176 


1 


DWTTNFTTFLYNFKPSSLMPYLSHLFKT 
LR/MWPGAVAHARNPSTLGGRGGWIMRS 
GDE 


10617 


24518 


A 


10701 


494 


80 


FNKKDIHSDTPSEGHQLQRPNVETLKKM 
GRNQCKKGENPKNQNASSPKDHNSSTPR 
EQNWMKNE S DEL I E VGFRRWVI TNS SE L 
YKG\DVLTQCKEAKNLENRLGKVLTRIT 
S LEKNGHGLMEVKN I AQELCEAS AGWR 


10618 


24519 


A 


10702 


264 


410 


KKGPLFTPPGGGGG/ PQKKPPGPLNPGG 
QRDSSFFPPPGGGNTGETPPGG 


10619 


24520 


A 


10703 


125 


3 


NRGNKGQV/ QWLMP VI PALWEAEAGRS P 
EIRSRDQPRQHGETL 


10620 


24521 


A 


10704 


184 


2 


RLRVLAP CRHL P RAPRT WPKRP F F F S F F 
LFFPQTQSHSVA\RLECSGAISAHSNLC 
FPPTRP 


10621 


24522 


A 


10705 


187 


406 


LFLWKFCLRGVPGHVRCQSALXLGGASQ 
LGSSGVRDPLEEAVCLFSDLQLRAGRTT 
TLFKAVRQGHLRLQRILL 


10622 


24523 


A 


10706 


82 


410 


ILRGKFGKHYFNRIDWREALRQSLSLFN 
F I IFSNFLAS LHKPEMETELKGS FIELR 
KAIiFQLNARDASLLSTVDSDFSFCRKFS 
R/CSKCGQ 


10623 


24524 


A 


10707 


390 


2 


SKDCRTAKIAACSFLWKLRSRGAPARCK 
PELSCMRCLSA\LLGGVSQSGGTGIRDP 
LEEAVFPLAELERCVGRSAALFRASRQE 
HZjSLLKMHPQLPLPSGALSQADGSFIYK 
PLTGASAFLSEMPCQERR 


10624 


24525 


A 


10708 


229 


392 


YS WWRQMHS VAHAGVQWCD /LVSLQPQP 
PG\ SS D P PASASLAARTTGAGHHNQL I F 


10625 


24526 


A 


10709 


193 


2 


RYLCYQDHVI LLLLLLLLFLRWSLTLSL 
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nucleotide 
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Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














RLECSGTIMAHCSLDS\SSNPPISASQA 
AGTTGACYH 


10626 


24527 


A 


10710 


180 


11 


LLFFFETRS/HVSQTWQCGMISGHCNL 
RLSGSSDPPPLASRVAGTTGKHQNIWLS 
R 


10627 


24528 


A 


10711 


234 


408 


MKLIMLKVILFPQIHLQIQPNFYQNIFF 
FETELCSVAQSGVQWHNLSSLQHPPP\G 
SSD 


10628 


24529 


A 


10712 


312 


470 


TGPHCVT \RLECSGAITAHCSLDFAGLS 
TSPTSVSQVSGTTGTWMKLETI ILS 


10629 


24530 


A 


10733 


64 


392 


PKMVIRISSETSLYASLPLQMKGQRQK/ 
CEPQPKKKKKKKKKKKKKKKKKKKKKAR 
G 


10630 


24531 


A 


10714 


169 


427 


NNQKTWNKMVGVS F YLS III LISIWELNA 
PIQRHRVAEWIKKEKKKKK/DPGICGLQ 
QTLFFYEDP/HDPLRLKIGGW/RKYYPS 
RGTQIK 


10631 


24532 


A 


10715 


176 


410 


ARSS WGLHTAVP VAF FSLRFLLE I FLDR 
DTRCSPPAAFFFETGSCCVA\RIiEGRGA 
ITAQCSLNLLGSSNPPTSASRVA 


10632 


24533 


A 


10716 


234 


389 


NSGNMDRYKDVQNI IQNPI CWPGTVAHA 
CNPS TLGGRGGQ I L / RGQE FETS LA 


10633 


24534 


A 


10717 


190 


2 


GPFPPHPGGFLRGFFVPNTIPPPTFFFF 
FFLRRSLSVAQAGVQWCDLSSLQPLPPG 
P\SNSPCQ 


10634 


24535 


A 


10718 


201 


1 


WPLFLPKMFFFFKRFPHMGPSPWVGAQA 
KKKKIFFFLTG£HSVT\RIECSGTISAH 
CSLNLPGPSH 


10635 


24536 


A 


10719 


195 


419 


E YHT S L VT CGNP C V YRS NNKLNQ T SRR K 
VITKIRAELNEIETEK/LQGSGETKIWF 
FEKINKTGLiELLGS SDPPVW 


10636 


24537 


A 


10720 


218 


407 


GKKNLAFKKKKRKE KKRKKS P S QSNMNS 
AKIEARTNI KL WKHGWKNCE 1 1 DA\LQ 
KAFGDNA 


10637 


24538 


A 


10721 


231 


409 


GTQLHLGGFFFSEKELSFCFWF/MFALF 
E ME AC S VT \ RME C S GT VLAHCNIiRLPGT 
SNSSA 


10638 


24539 


A 


10722 


40 


401 


PLCPSESSGNTLMASSDPSTPAVPPPNT 
THPPLCLSKSHLPLRPKQGLPSGNLLQL 
PLTLLI PLLGAPVACWQLPQQCTLSTFF 
FETKSHPVAQAGVQWCCLGSLQPLSPG\ 
SRDSPDSAC 


10639 


24540 


A 


10723 


210 


14 


HVMGLLLLFLNKLTVNNFGWLAGFWCLG 
FFWFFSFFETRSGSVT\RLECSGMISAH 
YKLCIPGSSH 


10640 


24541 


A 


10724 


347 


462 


TFFFFFFETKSRCDIQAGVQWCDLCSLP 
PSS\PDSSDCP 


10641 


24542 


A 


10725 


169 


386 


DKKQAKT I KWGKNS FSNKWCWNNC I ATG 
KRMKLDP\YLTPYKK/INSKWIKDLSI 


10642 


24543 


A 


10726 


257 


54 


PLFFFFKRQGLTLS PRLECNGTVTAHYN 
LKLLGSRDLPTSASP\SAGITGISNPAR 
LFTYLNPTHLQT 


10643 


24544 


A 


10727 


2 


401 


NNYDRAETQ I YQYMCLNPTFYCLQETHL 
T CND I YRL KVKGRRE I MQ IENKRVGVAI 
LVSDKTDFKPTTVKKKLHYIIKGSIQPE 
DLIILCTYSPNIRASRFIKRIPDLRKEI 
A/HTVKVGDFS ISLNRLSRQNT 


10644 


24545 


A 


10728 


154 


1 


PMCSLLGLSKGGI I FFFFFEAEFCSVAQ 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Ph en yla lan ine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibfe 
nucleotide insertion 














AGVQWRDLGSLQPPPPG\SSDS 


10645 


24546 


A 


10729 


34 


151 


PRPPVPSLLDRGRLQLWRQRGLRHRAHS 
NGF I GGKQQ IMKLLKNYVRRP VG /VAVA 
IMFDPDPRYPRSWIEDDFNYGGSVASAT 
VHIRMGSLVEOTKS 


10646 


24547 


A 


10730 


2 


408 


VFLLTVRTLICRSVGVWWRSTPDLVCLG 
ISSGGCRTANIGLYQMLLPDHSSGSFCL 
RGVPGHVRCQSAXLLGGPSQLGYSRVRD 
PliEEAVCPFSDIiQIiHAGRTTTIiFKAVRQ 
VHLILQRFLLRFVWLCPAPRG/GVYR 


10647 


24548 


A 


10731 


274 


89 


ENIP I VFNPPVCS PLLWQPQE / CEYPKL 
CKLCLKLHKHRVFY\FLFFFFETEPHSV 
ARLECSG 


10648 


24549 


A 


10732 


251 
» 


1 


T FQMMQ KC F S HRK I FHNLLD KAS YKI V Y 
KEDPFSPSSLSSSVSLKNNFFFLETESC 
SVTQAGVLWCHLSSLQPPPPG\SRDSP 


10649 


24550 


A 


10733 


375 


1 


APFPPLWVRGSPPFSPLVFPHPKLKPPL 
AE I LGFFKERKWGS IRKP CLFKVKKLVS 
VWPGLELQ I FKF IGEFPFS FPSLVGRKP 
NFFLGPVFFFFFEKESRSVA\RMECKGT 
ISAHCNLHLPGP 


10650 


24551 


A 


10734 


82 


398 


S F LWKLtRLRG VP SHVR CQS A \ LLGGAS Q 
LGYLGVRDPLEEAVCPFSDLQLPAGRTT 
TLFKAVRQGHLNLQRFLLPFVQLCPASR 
GGVYRGRQAS hS CGGLHPVRAS 


10651 


24552 


A 


10735 


2 


341 


TFCAISWLESGVEDGPRSRISYRISTFF 
F F F KGAPE P KE VRGR P KPGLAPT S L P GA 
HGTTGLSGWTTPSIGGWQPPPPPRENPK 
GEHPPAPVAGDTFQ\PKKPPPILKVFFP 
K 


10652 


24553 


A 


10736 


171 


1 


rvfyllafalfvdtgs pfvsqagvqwcd 
hssiqsqtsg\s nd p pas ayrvagt tgv 


10653 


24554 


A 


10737 


190 


1 


EHKTTVRKI PKMEEMADSGSNMLRI I I F 
LFLFFETKSCSVT\RLECN/GSISAHCN 
LHLPGSSNS 


10654 


24555 


A 


10738 


252 


413 


GLLGLQNFSYKVHLTKAHLKKCSWLDAV 
AHAYNPITLGGRGEW/IQEFQTSLTNV 


10655 


24556 


A 


10739 


101 


249 


AGSTSRCI /QELSDLLEHLEQENCLNPG 
GRGCS E PQS CHCT PAWVTE TQKK 


10656 


24557 


A 


10740 


185 


403 


LGL P VHTMKSN YH YHL CHHHHHLQHHHH 
HHHCHCHYHS/HHHNSQHPPPPPPPPPP 
HHHHHHHHHLP 


10657 


24558 


A 


10741 


119 


425 


IKFHLHSMERTHLFLiWNCWAHKQRNCFK 
ADFVITDDIKQLCPPQSWWTRAGKLPLG 
AGRGGSHMLSHHFWRPRHAD ILYLGVLD 
QPGQQGETPSL/LKNTKSSW 


10658 


24559 


A 


10742 


112 


2 


GETFFFETGSSSVA\RLECSGAISAHHN 
LHLPGSSKSP 


10659 


24560 


A 


10743 


188 


3 


PPPLFFFFFFFFFKTGSGSAT\RLECTA 
HCNCCLPGSSHHPTSAYQVARTTDVCNH 
AWLIFV 


10660 


24561 


A 


10744 


176 


3 


SLYSKNKNPSHLLFLPIPIKFFFFFFET 
EFRSVAQAGVQWHDLSSLQPPPPG\SSD 
SP 


10661 


24562 


A 


10745 


250 


1 


GPRRI FFLKEFYPRFVFGKNPAPGGFFS 
GGKKPGPFPFNPRP I KFFFFFFETEFRS 
VAQAGVQWHDLSSIaQPPPPG\SSDSP 


10662 


24563 


A 


10746 


155 


3 


' PPHPFPFNPLP I KFFFFFFETE FRS VAQ 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
KHLysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














AGVQWHDLiSSQQPPPPG\SSDSP 


10663 


24564 


A 


10747 


157 


1 


TPRPFPFNPPPLKFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10664 


24565 


A 


10748 


157 


1 


TPRPFPFNPHPIKFFFFFFETEFRSVAQ 
AGAQWHDLSSLQPPPPG\SSDSP 


10665 


24566 


A 


10749 


153 


2 


PPPLSWTPAPLRFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10666 


24567 


A 


10750 


157 


1 


PPPPFLFFPRPLKFFFFFFETEFRSVAQ 
AGVQWHDLS SLQPP PPG \ S SD S P 


10667 


24568 


A 


10751 


155 


3 


NTFPFFFTPLTPKFFFFFFETEFRSVAQ 
AGVQWHDFSSLQPPPPG\SSDSP 


10668 


24569 


A 


10752 


86 


406 


S FL W3CLHLRRAP G CMR CRLA \ LIjGGVS Q 
LDYSGVRDPLEEAVCPFSGPKLHAGRTT 
TLFKAVRQGHLSLQKFLLPFVQLCPAPS 
TGVQEGRQASLSFGGLHPVQSSR 


10669 


24570 


A 


10753 


364 


469 


GQFLHSbD/ SHWKKSEDFCFLWFPHSEN 
VSAIHQDH 


10670 


24571 


A 


10754 


250 


61 


GEKKKKKPRVFFFFFFFETEPGSVT\RL 
ECSGVLSAHCNLRLPNPNDSPASASRVA 
ASAKLG 


10671 


24572 


A 


10755 


66 


446 


S FL WKFCLKG VPGCVRCQS A\ LLGGAS Q 
LGYSGVRYPLEETVCPFSDLKLRAGRTT 
TLFKAVRRGHLSLQRLLPPSVCLCPAPR 
GEAYRGRQASLS CGGLHPVRASRPLCLP 
TQALAMVGAPPPGSL 


10672 


24573 


A 


10756 


252 


482 


RL PRQP VRKWVAG VRGC CVWGWS KS LQ 
RSTTLDWQGPPQRGGPILLFFFFETLCH 
/TRLEGNGEISAHCDLCLPGSN 


10673 


24574 


A 


10757 


205 


488 


PLESLRSPLGPLSWRTASSVSPVXCIYY 
I Y I Y I Y I YTHTHY/ HIHTHTHTHTHTHT 
QYI FVCWFLKDRVLLCHPRQSAGARS W 
PTRTSASQRQEM 


10674 


24575 


A 


10758 


272 


471 


YSYVLFFIFLGIESCSV/AFSAGVQWHN 
HSSLQLQTPGLKQSSHLSLP/ASASPVA 
GTTGMRYHARL I F 


10675 


24576 


A 


10759 


17 


342 


GTLSSGAQVLIGRIESIWVGLKPWALG 
GCP S P RAVHWLLAS S DWRP S LQDGAEGW 
KKGEANGNHKR/GIAVISDQIDFKTKTI 
KGD\KKSHYVMIKGPIQQEAITrrNI 


10676 


24577 


A 


10760 


42 


498 


EFRERGREKEREKERKTEERMEDRERKA 
EREREKERQEGREREREKGRQRERERKR 
ERERKRERE/RHEPGSL 


10677 


24578 


A 


10761 


1 


305 


AS WDDPAHNNNFHI PGGWAHFFFCRLI* 
DCPPWAPRPPAPR/VLL/TPAAAAAAAA 
AASRPEKKRAEENVGAIPPRKRPPRWEE 
RRGPRKGSAP PGRGAGRRAR 


10678 


24579 


A 


10762 


364 


2 


FPQKQNGQYLPLFPKPTGLCGKGKTRLG 
FFLNYFGKKKKKRGRELGFPPFCQIWGT 
S I YRGSMG.YFFFFFFWRQVSIiALLPMLE 
C\RGMIRVHCSLDLVGPSNPSTSASQIA 
RTTGTCMCHH 


10679 


24580 


A 


10763 


152 


484 


RFVCSTIiCVLRDIiSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPTODIIS 
T 


10680 


24581 


A 


10764 


208 


420 


SFLWRFPLRGVPGHVRCQSA\LLGGASQ 
LGYSGVRDPLEDAVCLLSDLKLCAGRTT 
ALFKAVRQGHLSLQR 


10681 


24582 


A 


10765 


319 


85 


GKLLNHNRCFGSKFQGIMFFFGETESPY 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H— Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T>Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, V— possible 
nucleotide insertion 














ITQSGVRWHDFGSLQPP\PPGSRDSPAS 
AS \ RG I TGARKHTQ P I FLL VS NS 


10682 


24583 


A 


10766 


122 


3 


KFFFFFETE FRS VAQAGVQ WHDL S S LQ P 
PPPG\SSDSPSL 


10683 


24584 


A 


10767 


240 


86 


RAPPFFFFEMESCS I AQAGVQWLS LGLL 
QAPPP\GSCHSPASASCELMFSKI 


10684 


24585 


A 


10768 


244 


85 


RAPPFFFFEMESCS I AQAGVQWLS LGLL 
QAPPP\GSCHSPASASCELMFPKI 


10685 


24586 


A 


10769 


35 


296 


EVKSPSARQPPRLGSEEHLRPAAAPSGR 
EVGGQPPPGQPPCPGGEG/PPPPGSPDW 
E VRSPSARQPPRLGGEPNS SLRTGHDDD 
GGFV 


10686 


24587 


A 


10770 


260 


484 


MDEELLLMNEQRTW FLEVE S S PGEDAVS 
I IQLATRDLEYDLNLVEKGAAG/LERKH 
YSFERSSTVDKILSHNTACY 


10687 


24588 


A 


10771 


248 


393 


TQEGKKLINWPGTVAHACNPSTLGARGG 
RTTRGQELETK\LANKIKPCL 


10688 


24589 


A 


10772 


239 


2 


S FLWKF CLRGVPGRVRCQSA \ LLGGAS Q 
LG\SQGSGVRDPLEEAVCPSSDLQLRAG 
RTSALFKAVRQGHLSLQRLLLSF 


10689 


24590 


A 


10773 


158 


455 


LFFTLCPSLLQHIAVMLELGLKGPKSIQ 
P I FWVFLQGTEP / HFLVTPVR/ CCLPLL 
KLFCLLVFGMESPSVPHAGECSGVISAH 
CNLCLLGSTDSSASPSRV 


10690 


24591 


A 


10774 


166 


403 


KKTFLGEPLFWGGAKKKKPGKKNPGFFP 
RG/ 1 KPRVFFSRFF FFGPP P KKGFPQKS 
FFLKSLPGFFLFGGCPPPFFFFFFFFEM 
ESCSNTRLECSGVILAHCNLCLPGSSDS 
P 


10691 


24592 


A 


10775 


208 


1 


RLCFFYFRKALLGKAQIKNIAFFPRKGS 
FFFFSETE S RS VAQAGL / LD C S GA I S AH 
CKLRFPGSRQSPAS 


10692 


24593 


A 


10776 


192 


29 


IFRKEFPCLNFFLLFFETESRSLAQAGV 
QWRDLGSLKAPPPG\SRRSPASRDTGV 


10693 


24594 


A 


10777 


275 


490 


KPGFFLLQIYLAKVYMIFFFFEMEFCS \VA 
QAGVQWHDLGSMQHRPPG\SGDSPSCLP 
SI/AGIPG 


10694 


24595 


A 


10778 


340 


3 


LKVPVGKPRSTLLGVKKVPFFGNKRKKF 
LALFFFPRPPPGEGFSTAFLAQKPTPRV 
VPALGFPKNQSPLSPFFFFFFETKSCSV 
TQAGVQWCGLRSPQPMPPG\SSDSPASC 
L 


10695 


24596 


A 


10779 


313 


1 


ANPFGGPS GGDPFS SRVFPP PGPKNETP 
F FKNKTTATKKTGNKGGWAPS PHRGGP K 
KGPALWDKKGKNLWPFFFFETESHSVT\ 
RLQCSNTILAHCNQCLPGS 


10696 


24597 


A 


10780 


116 


422 


ILEDTNIQTIETLLAIREVQIQTTLRNH 
FTLTGMAI I /RKTDNNKCWRECGKIETL 
ICCWRECKMLCNEVGTALEIVWQFLQSL 
IsTIELS YDLAMPGFI IFPRE 


10697 


24598 


A 


10781 


120 


418 


TQTTGAPQLHLASRWLSRSGGLTS SPQE 
I PKLFWS I ES PLGS S KHLSLQVFVCLFV 
CS FVFEME S CS VARLECS I VI SAH\ CTL 
HLWGS SHFHAS ASRVA 


10698 


24599 


A 


10782 


134 


I 


SSFFFLCQTESCFGVQWHDLSSLQPSTS 
RAQA\LSLPSSWDHRR 


10699 


24600 


A 


10783 


2 


435 


CSHRGDSSSYSQLSGIRAGDLGGGGKDI 
FRLLPTTLNI FAGKES YDWCVTHERMC 
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last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
KNLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 














IiC/SCLFIYMCLCMCVHV/HYIHEACVF 
MCLCL 


10700 


24601 


A 


10784 


432 


161 


FSRAGFH/ RVSQDGFDLLPS \ CLPPLGL 
PKCWDYKREPPRPAWKLMCRQVHRKCTW 
LFIQIGSTLF KTNGGIi S AVAHAYNPNT W 
GGRGERIA 


10701 


24602 


A 


10785 


270 


442 


NVRLRLGLxALS PTLE CRGT I MAHCRLD F 
PGLMQSSHLSHRVAGTTG/TCHHAWLI F 
KFL 


10702 


24603 


A 


10786 


12 


424 


L I QLPRLE CSGAL I ARCNFT Y / S LGSGD 
PPTSASQ/VLKTTGVCHHAQL 


10703 


24604 


A 


10787 


350 


3 


DSSSVQTNKGIPGQVLSDILSCAVKASV 
LVI I PLHNALS S WL VPAL FLWRS WQVGK 
S QQAHS S VS GLC / MHYRSHTHAHTHS PH 
RHRHTYT YARAHTHTHTQMLSAYLPS KQ 
PSGSLS 


10704 


24605 


A 


10788 


155 


3 


HAFFALC I RNRLE CNGVILAHCNLC/RL 
LGSSDSPVSASQVNGIAGACHHAQL 


10705 


24606 


A 


10789 


188 


2 


RRDLSS LQPPRGGQ / MRGC I YTHTHTHT 
HTHTHTHTVHWGWGKRHHVPKGKMSANG 
ESGRAK 


10706 


24607 


A 


10790 


19 


405 


I RPT I S RVERG INS LVASEGQRL P WDG I 
ACSQGLVWQQTWGPLGPFPSLLGMPHR 
PTFRDLNSEPAPGVANVSGTLSTPLPGA 
S HGLLVF F FETE S HF VAQ AGVS WGDLR S 
LPPPPPG\SSNSPVSAS 


10707 


24608 


A 


10791 


157 


1 


KPGPFPFYPGPLKI FFFFFETEFRSVA\ 
RliECSGTISAHCNLHLPGSSDSP 


10708 


24609 


A 


10792 


157 


1 


QPRPFPFNPRPPKFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10709 


24610 


A 


10793 


83 


387 


SFIiWKLHPGGAPACMRCQLA\LLGGVSQ 
LGYTGFRDLLiEEAVCPFSEIiKHHAGRTT 
AVFSAVRQGCLSLQKFLLPFVQLCPAPR 
GGVYRG/RQALLS CHRLHP V 


10710 


24611 


A 


10794 


204 


419 


KGVYGHS GS FS P P APLACFRDKTL FPVS 
LWKEFVHSPWCKCTLPQPLWKTVWRYLK 
NFKME \ I P YVPE I PLLG 


10711 


24612 


A 


10795 


207 


2 


RRGFTMLVGQNSLDPSTSRSAHLSLPKC 
WDYRCKP / PAPS / LREGFS YLLHSLLHP 
TPAPGSHHLWAALIC 


10712 


24613 


A 


10796 


2 


237 


FFFLRERILIALSPRLECSDANMSHCSL 
NIiPG / FSQS FCLSHP \ SRWDHRHMPPYP 
VKFFGI F VGDR VLALFPKL I S YX I 


10713 


24614 


A 


10797 


342 


40 


DRVFFCS PRLECSGAI I ARCS / LRTPGL 
KQSSHLSLPKCWNHRCKPPRPARCSLNE 
S HS AKKW P GQSTD S S ACEEVQ P T S P FP T 
VPYPTLIiSPSAGHGRGR 


10714 


24615 


A 


1079S 


209 


2 


CSVLAVILNPECIYWPWLELLAHDPIP 
RLVCPKS FFFETKFCSVAQAGVQWHDLG 
SLQPPPPG\SSNSP 


10715 


24616 


A 


10799 


142 


2 


I FFFFFEMESRS VARPGVQWSDLGSLPP 
PPP\GSSDSPASATPSPMQS 


10716 


24617 


A 


10800 


276 


2 


ILPIIIRNTCCCFFLKESRSCSVTQARV 
QWHI I / GS LQPQT PG \ S SNP PAS AFQ VA 
TGAHHHTSL.IRNTLI I INWEKNTHKLSG 
CTSMHFRFL 


10717 


24618 


A 


10801 


12 


364 


LHHYKTVSGIYKCLCVLNGPPTRFFPDF 
LPPLGLPYSLRHNNIEISPINNPPIASK 
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peptide 

sequence 


G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














YSRERMSCISLTLNQTLVMIKLNEESLL 
KANS DHKLCLLR / S VYQWNAYEKFL 


10718 


24619 


A 


10802 


235 


3 


HRPSEDLDFPPHWAVMRAPQYPSWDTRE 
GGPRSFLFLFLFFEMESYSVAQL/EGTI 
SAHCNLCLPSSSDSPTSRALCSL 


10719 


24620 


A 


10803 


349 


1 


NQTPFFFFFFGGTETTSTTLCS\YGLLI 
LLKYPEVA/ESASQRDPEWEAAVWRWLE 
GPGSAQPPSAPAKGQELDPWGQRPVPS 
PDDHVQWPYTNAVLLEIQRFI SVVKRTL 
TLDTLY 


10720 


24621 


A 


10804 


365 


3 


I D VCNV C VR KQ YRGFGD Q KYRAW I S P V Y 
PHKCGWHRVYNTPTPHCETEWLWVILHA 
QEH/TFSLTGRHTHTHTHTHTHTHTSKL 
APPASRALFGVAHVEAQKALAS PSSGRY 
LAITMFVQPCI 


10721 


24622 


A 


10805 


406 


96 


CPPEFSEESPRLLKFRVGGYLTPQVSKC 
GLGWRIFKVFFWSPPKVQTSLFFFSKT 
GSHSVT\RLECGDTILAHCDLCAPGSGD 
PPASATRVTVTVGLPPCPAR 


10722 


24623 


A 


10806 


58 


369 


FFFFKGDRAQNNS / WGERCLLNKGYWDI 
WI S TCKKMKS TP YLTLHTKI S S KGLKDL 
IIRAKRIHLLKKYIGINLHDLGLKXDFL 
NMTPKTLATKEKIDTLDFIKIK 


10723 


24624 


A 


10807 


126 


1 


KEPFFFFFFETESCSVAQAGVQRCNLSS 
LQHQPTG\SSDFP 


10724 


24625 


A 


10808 


204 


415 


HLGFDSLTTCISLSDGLKYKATVFLVFF 
FERESNC\AVWAECNGPISVNCNLRLPG 
SGS S PAS PSRG VE I T 


10725 


24626 


A 


10809 


168 


2 


KISKRPFFFFFFETGSHYVA\KLECSGV 
ITAHCSLDLPGSSNPPTSASWVAGTTGT 


10726 


24627 


A 


10810 


97 


389 


LAVSPSLSLSLSLSLSLSYLGLPYSLRR 
SNIEITPINTPAEGSVCSSERKGHMSLS 
FNEKIEVITLSAQDMSNTKIG/RKLDLL 
CH/TSQWNAEEKFLK 


10727 


24628 


A 


10811 


24 


416 


LEYIARRYLGWWLFFFFFLNRQGEKSR 
GPFKFFFPRGFFSTRNGAPPGGPWGPLP 
WGGGP P VGFQKQGKGGAP \ P P KKNRFPK 
GGPLTQPNLPKTPIENPKGPPTRGFFPS 
GP P P KKGAG P P P I FRVG P G 


10728 


24629 


A 


10812 


60 


435 


KKRKNFPQKKI S P YF YPLKWFKTPPLWV 
KNQTPPV/ GC FFEAPFSLKKPPRGLKKG 
/WGKNFPPPLVPQKKKKKDTAGVAILVS 
DKKDFNPTKIKKDKGHYIMVKGSMQQEG 
LGCPDAWVPS 


10729 


24630 


A 


10813 


618 


1519 


FWFGVNCKSVCVPFLPVLCFWCWCER 
GVLSQRSMGQAQS KPTSLGTMLKHFKKG 
FKGDYSVTMTPGKLRTLCEIDWPALEVG 
WPSEGSMDRSLVS KVWHKVTCKPGCPDQ 
FPYIDTWLQLV/ YRPPPY 


10730 


24631 


A 


10814 


179 


15 


KVKRLKTP FFFFFFETRTCS VAQARVQW 
HNHSSLQPQPPGPKRS\PTSVSREAGT 


10731 


24632 


A 


10815 


61 


422 


NCFFLKGPPLFFFFFFFFFFLLQIIVFI 
YKFFAFLFQMEFRSLPRLDKCNGAISAH 
CNL S Ii P S S WD YRNLP P RLANF FL FL VE T 
RQPAS A/ FLTCWD YR 


10732 


24633 


A 


10816 


170 


440 


RQGLSLLPRLECSSMTTAHCDLKLIiSSS 
NPP ISAPHIALG/LTGLCHHTQL 


10733 


24634 


A 


10817 


322 


443 


FI F I FYFFETRSHAVAQAGVQWRDLALIi 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
0=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q==Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y-Tyrosine, 
X-Unknown, *=Stop codon, /=possibJe 
nucleotide deletion, V=possible 
nucleotide insertion 














QPPPP\GSSDSPA 


10734 


24635 


A 


10818 


134 


3 


KKAHLPFFFFLEGTESHSVA\RLECSGT 
ISAHCNLCLPGSSVSV 


10735 


24636 


A 


10819 


76 


245 


FLLRKGTRQGGPLS S / LLFNI VRDVLAR 
AIRL\EKKIKDIQIGKEEAELSLFADMI 
IF 


10736 


24637 


A 


10820 


233 


440 


FFEFGALKKWKSMTHKKVERGKRTTLNG 
GGIKAAKTASLKKHLQGWGKVMLMANKL 
LRGERAW/ FPPRIRG 


10737 


24638 


A 


10821 


293 


402 


RNPFFFLETEPHSVAQGEVQWCNLSSLQ 
PPP\PGS 


10738 


24639 


A 


10822 


168 


1 


PLRGLSDSSLGIMKRMATDLSSLQPLPP 
G\SSDSPASASRVAGITDSHHHAWVIF 


10739 


24640 


A 


10823 


97 


405 


LCVNI CFNFPWLGVEWLNHI IGVGLTFF 
FEMES CSVARLECSGAISAHCNLL/LPP 
SFGSSDSPASVT 


10740 


24641 


A 


10824 


186 


2 


EPGTI SLVALKLQKWPRR/ SDHLRLAVR 
DQPGQHDETPSLLKMTKISWAQWQAPVI 
PALEPV 


10741 


24642 


A 


10825 


199 


1 


KKT Q I GGAPGGAL F F FF F FE TG F P F VA \ 
RLQCSGTNQAQCSLNLLSSSNPSFPALQ 
VTGTPSACP 


10742 


24643 


A 


10826 


252 


381 


GQQEFFFPCSVPQAGGQWCNLSSLQAPP 
PG\ SHHS PASASRVAG 


10743 


24644 


A 


10827 


77 


427 


IPQVHCPMSPPVPMACIPRVSSFTSWVF 
HNLLPPSECPLGPLVPASSHPRPCVCCR 
PCTS WS \ GPLWPRPP CSNS P V/ TCVP CL 
PCLCISEIPSCVPWP/WTYSSLCPMSHV 
PDSPCPLP 


10744 


24645 


A 


10829 


20 


518 


SFAFSLLQHLTETS FAINS CSEATLLFL 
SVFL/RAQTLTAPCQTRGPRRGKDRGSG 
SSSPSGPKATKESSVERRKSFDSWGHRF 
AA\ QRLMDNQAERE S E AG VGLQRDEDDA 
PLCEDVELQDGDLSPEEKIFLREFPRLK 
EDLKGNIDKLRALADDIDKTHKKFTKAV 


10745 


24646 


A 


10830 


351 


3 


LHFSPFLQHRQNIKVWLMRASMQRHSRT 
HGAF P LtHQDE I QTR PRLR S AAS S GP S LL 
SDHILSTLPAFTCANLSSRFCSSSPCSC 
S CLRA/ CCTCHFDLS TL IHAHTRTHTHT 
HTHTMY 


10746 


24647 


A 


10831 


8 


376 


GML PAHLADVLRHNS VGRPKHMRVMAGA 
LEGDL F I G PKAE \ E HRG 


10747 


24648 


A 


10832 


336 


38 


GVATEGVGEAAQGGEPRQPEQ/ PPPQPY 
' PPPPQQQHEEEMAAEARQA/AGAPMDDG 
FLSLDSPSYVLYSDRAEWADIDLVLQNV 
GPNPWQI I YSDKYTLWK 


10748 


24649 


A 


10833 


206 


1 


TYFFPFPPGLFFIAGIFFFFFLETGSH/ 
SSLRLECSGIITASCSLNLPDPNDPPAS 
ASQVAETTCATMY 


10749 


24650 


A 


10834 


226 


376 


RISQAI ISFYFYFLFETESRSVAQAGAQ 
WSDLSSLQPPSPGV\SDSPALPS 


10750 


24651 


A 


10835 


213 


1 


DRVLLIiSPRLGCSGMITAHCNLHLPWFK 
RFSCLCPPE/SSWDYRCPLP/PPRLTSV 
FLVETGFHHVGQAGLR 


10751 


24652 


A 


10836 


351 


447 


RENLWLTPVI PALRE AKAG I \LEPRSSR 
PAWAT 


10752 


24653 


A 


10837 


71 


472 


SASTAPMAPVKKLWKGG/K/KKKQVLK 
FTLD C I HP I EDG I MDAA/NS TNYEQFLQ 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K— JLysine, L~Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ERIKVNRKAVNLGGWTIERSKSKLTVT 
PEVPFSRRYLKYLTKKYLKNNWLCDWLC 
IVANS KESYELRYFQITQDKEEEK 


10753 


24654 


A 


10838 


37 


382 


SRCIMAFYLYGIRSFPELWKSPYLGVGP 
GHSYVSLFVAGRCGVRNQQRLCSVKTMS 
P QNTKATNV I AKAR YLRKDE GSNKQVYS 
VPHFL I AGAGKERS QMNS QS EDHK/ LA/ 
PVRNAV 


10754 


24655 


A 


10839 


313 


617 


AFFLIVFQYVARERERVSRS/VNCGLYQ 
HCYFRTL 


10755 


24656 


A 


10840 


309 


511 


WEQGSWEHVLQAPCSLIAPRITHAYTHT 
HTHTHRYTHTHLYPH/APTS IL 


10756 


24657 


A 


10841 


269 


450 


TPWPLKKEFFFFLKWSIiALVAQAGVQWR 
DLLGSSDSP\LQPSRFKQFSCISLPSSW 
DYR/PC 


10757 


24658 


A 


10842 


420 


1 


LERGENGTRRDRRKGLSHCHQPMDSVLP 
PLCHPPPLLVTMEEEITMLFIDIGSSMY 
KAGF AG / DD AS RAM F P S I VRC P WHHGVM 
VGMSQKDS YVGDEAQTKHS I LTLKYP I K 
HD I I TNE \ WDNME K I WHHT F YNKLHVAP 
CI 


10758 


24659 


A 


10843 


431 


1 


GEHSWASDLAEDVTKVTRGPLGPLFWGL 
CSGNLSFSGCVPGLPAGAVPPWVPVPFQ 
GGA/SWVWKGPSPWTLHLLLRTWGLAGG 
VGGRSLGRWRAWPGNPGSQGQGAPPAHQ 
ATGTPRSRTGS TGG I ETVT ILEGSHVS G 
MGIR 


10759 


24660 


A 


10844 


55 


297 


QRWPGTLLGVQPG/APPDSTSASGSGGA 
RGGPVPTLEGGNTGSRKWEDPCWGSQNS 
PSSDGSKPPPGPLTSKVCDDSRLSE 


10760 


24661 


A 


10845 


109 


427 


QTG P S AAGLLE FARG P LQ TL FAWVPAAV 
AAEQQ I FVNRECCCL I VPLEFCLRGVP C 
CVRCQSA\LLGGASQLG\SRGSGVRDPL 
EGGSCPFSDLQLHAARTTALFTA 


10761 


24662 


A 


10846 


17 


416 


S FLWKFCLREVPGRARCLS A\ LLGGASQ 
LGYSGVRDPIiEEAVCPFSDLQLRAGRTT 
TLFKAVRQGHLSLQRILLPFVWQCPAPI 
GGVYRDRQASLS CSGLHP IRA/ SRAAVP 
TQASAMAGAPPPDSLPPCSLSSN 


10762 


24663 


A 


10847 


265 


429 


LSGASCCLFSLPIVLFALLLLSVLSFSF 
CAWCVPSWSLSPTLFLTFVLHFL*RFIS 
CF*VNSNPKNLTFMVNFQGKGMKIGLAL 
ERIC I CGCSLVFPEAI LETSCQNLiFCTY 
ACAGVL S S VYDYLALLAVS FL FL * FFSL 
FFFFLYYLFLFALGASLPGLYLLLYFLL 
LCCISYSLF 


10763 


24664 


A 


10848 


12 


462 


QTLGTKMNEGLFATFIAPTILGIiPDALL 
I ILFPPLLI PTS * YLINNRLI ITQH*LV 
KXjTSKQMITIHNTKGRT*SLILISLIII 
I VTTNLLGLLPYS FTPTTQLS INLAMAI 
PIi*AGAEVIGFRSKIKNALAHFLPQGTP 
TPLIPILVII 


10764 


24665 


A 


10849 


2 


462 


TTLHAFGTMKREAF I TLLCLFTS ANSRG 
VYARDAHKS E AAHR I KDLAEKDFLALVIi 
IAYAQYLQQCPFEDHVKL*NEVTEFAKT 
CVADESAETCDKSLHTLFGDSLCTVATL 
RE T YGEMVD CCAKHE PVR YE C FLQHWDD 
CPNLP * WRPEVD 
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acid 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10765 


24666 


A 


10850 


164 


1 


RGFQVRGE *NSGRETER * RERE * KLARE 
RAIGQEREREKSREAGRERERERKKE 


10766 


24667 


A 


10851 


417 


100 


MMLWGNLETRLWSQAQTIPCTKQRSSL 
FFPVLNAAIFRSKSKVLNCLEYFLGNTL 
FSWDLIPEILFPLSHYHHYHHHHHNHQY 
* KS * QWSASNTRFQGILDKNS Y 


10767 


24668 


A 


10852 


339 


3 


E TTHS E ARRGRS AAAS CRGS ALRRGR F P 
E S RRGRE AAP VC P RHVLL * GAQ S KQAAV 
AGKRS GTRHAS RW P KS L F T PRRRR I S L K 
RALHF WQQ S ADP S P S VSR I PPHGVGS S 


10768 


24669 


A 


10853 


1 


373 


LQQKGMRKRAGQSEMAPAGVSLRATILC 
LLAWAGIiAARDRM YLHP FHLVI HNEST * 
EQLTKANAGKPKDPTFIPAPIQAMTSPV 
DEEALQDQLVLVAAKLDTVDKLMAAIWT 
MLAIFLGFRIYG 


10769 


24670 


A 


10854 


1 


423 


VSCSFLKLKTMKHGL.IiLLLCGFLLKSQG 
VNYTEEGFFRARGHRPLDKKREEAPSLR 
PAP P P I TGGG YRARP AKAAATQKKVER * 
APDAGGCLHADPDLWV*S PTGCQLQEAIi 
LQLERPITNTVDELNNNVEADSQTSSSF 
L 


10770 


24671 


A 


10855 


343 


3 


RGCE AHPL PRS EGPAGSALiAQ P VMYCT I 
FAGTLITAISSH*FFT*VGLEINMLAFI 
PVLTKKINPRSTEAAIKYFLTQATASII 
LVIAILFNNILSGQ*TLTNTTNQ*SSSI 
I 


10771 


24672 


A 


10856 


147 


1 


TRTPTGQCVS P KSMFLGAVAHS CNP I TL 
GG*GRRIT*GPEFDPSLANMV 


10772 


24673 


A 


10859 


189 


422 


NHTMDDFERRREL I RHKRE EMRLEAER I 
AYQRNDDDEEE AAR * RRRRARQERLRQK 
QEEESLGQVTDQVEVNAHNSVP 


10773 


24674 


A 


10860 


90 


273 


SHQE I EQNS AMAPRKRGGRG I S FMFYCL 
RNND QR YMT * RL * S G I GWMLL S S GRMG Y 
ALPG 


10774 


24675 


A 


10861 


25 


411 


AP CAKP CGD WR S RGS L VWAM S G CNARKG 
DCCSRRCGSHL*N*IPTDWPLN*FFLTS 
AKVKECFPKKES I YSQTVY* SPG*KM* T 
*DKEHPRYLIP*LCIQFYHLCWVTGTGG 
GIILKHGDEIYIAPSGV 


10775 


24676 


A 


10862 


33 


302 


SRRATLIYVDMENGEPGTRWAKDGLKL 
GS GP S MLALNGR S QVS APR FGRTFNAP P 
SLPIAT IRALGTVNRATEKFVKTNGPLR 
Q*QPRS 


10776 

- 


24677 


A 


10863 


1075 


1521 


YCHTGKGEQLiGERFCEGVSRRGPAERGS 
DSQTPWPWPLCAAAGTSAGTSHSGSSSG 
AFSSWPCWTAAEPAARKRGRPAGSWSS P 
ATGAPGRCRHRILSRGAGGSAGFVCSGL 
AESGL*ESSSPGRSQG* PQGQRHPQPNG 
LPAPPSTSV 


10777 


24678 


A 


10864 


245 


418 


TSKIiAFPISIPVIYANKVCP^FSKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKVF 
FFEGPGFFF 


10778 


24679 


A 


10865 


132 


398 


LNMGKGD P KKPRGKMAS YAF F VQT * RE E 
HKKKHPDASVNFSEFSKKCS*RWKTMSA 
KEKGKFEDMAKADKARYEREMKTYIPPK 
GETF 


10779 


24680 


A 


10866 


115 


455 


LLTRNMDRLL ILAGGMPGLGQGP PTDAP 
AVD T AEHVY I S YL AL L KML KHGRAG V P M 
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NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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NO: in 
USSN 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, X=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine ? 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EVTYIGLMLGEFDDDYTVRVIDVIAMPQS * 
TGVSAEAVDPVIQAKMSDMLKQTGKPEIVI 
D 


10780 


24681 


A 


10867 


64 


483 


QGERPAAAMKI * SLTLLS FLLLAAHVLL 
VEGKKKVKKGLYI KVDSE * KDTLGNTHI 
KQKSRPGNKGKFVTKDHTNCTWAVTEQE 
EGISLKVECTQLDHEFSWAYAGMPTLCL 
KLTDEIVYWKPVARMLRLQNDIMIYSPQ 


10781 


24682 


A 


10869 


229 


1 


DTTILKLGRLI ILQWLLSEGKSHMSFTL 
NQK*EI IKLSKECMSKASLSQKLGCFCQ 
VGSQVMHAKEMLLKEIKSAT 


10782 


24683 


A 


10870 


934 


515 


KKVLLCLiPGLiGCGGVILlAHGSLALPGSS 
SLHLSLQSSWGLTGM*YDARLIFVYFFR 
EMGS CHI CQAGLNS YNSS I PPTLVSQNV 
GTTGVSHRAQLAPVFCFCFVTLVDVKAL 
HFGKQENRIFFFFILFPGGSRDEDRVA 


10783 


24684 


A 


10871 


373 


424 


EFKTNLANMG* CLKSQHLGRPRCANHMR 
LGVQNQLGQHGE I P FLLK 


10784 


24685 


A 


10872 


I 


469 


RSRSGDSLPACDRPSGASALATCETIFS 
AISCFWDLPAPSLRLTPSCQPTMSSQIR 
QNYSTDTEAAVNSLDNWYLQASYTYLCL 
GFYFDRDDVALEGVSQFFRELAEENREG 
YDRFLKMQNQRGGRALFQD I MT PAEDD * 
GKTHNAMTAAMGLETK 


10785 


24686 


A 


10873 


244 


2 


NIYSGKETFSVPLLAIHVYHFLVGRGGS 
QGTEMLWHRVDLRYREQAGHSGSHL*SQ 
HFGS P RQ VD QLR YGVS DQRGQHGE 


10786 


24687 


A 


10874 


83 


427 


ISLNMIRIAALNASSTIEDDHE*SFTRH 
NTQTKEAQEAKAFALYH*ALDLQKHDRI 
EESAKAYHELLEAS bliREAVSSGDENEG 
L KHPGL I HjKYSTYKNLAQ LAS QREDLET 
AME 


10787 


24688 


A 


10875 


94 


1 


KSQKACNPSTLGG*GGWIT*AQEFTTSL 
ANT 


10788 


24689 


A 


10876 


2 


413 


GVTRG FNMR I E KC Y FCSGPIYP GHGMM F 
VRNDCKVFRFCKS KCHKNF KMKRNPCKV 
RWTKAFRKAAGKELTVDNSFEFEKRRNE 
P I KYQLEL WNKT I DAMNRVE E I KQKRQA 
KFIMNRL*KT*ELPKVQDIQEVQLN 


10789 


24690 


A 


10877 


65 


417 


RFAGAGAI PEARAWPTDVHAAEEEKEMD 
LPDLASRVFCGRILSMVNTDDVNAIILV 
QKNMLDRFEKTLEMLLNFNNLASARLEQ 
MS ERLLRHTRTL * DMKPDLGS LFRP IRT 
LEWKL 


10790 


24691 


A 


10878 


4 


442 


AP T P DAMGH FTE E D KVT I TGLWGKVNVE 
NAGRETLGRLLVDYPWTHRFFDS FGNLT 
SG S VI MGNP KVKAHGKNVL T S LGDAI KH 
LDDL KGT FA* LTE LH CDKLD VD P ENF KL 
LGNELETDMAIHFGQDFTPEVYAYLQNM 
VTWM 


10791 


24692 


A 


10880 


1 


419 


GKHIRQYHEEKETGQRINIHEYYLGNML 
AKNLLFEKEREAEEKEKS YE I PTKNIQG 
QMTPYYPVGMGNGTPCS*KQNRPRSSTV 
MYICHPESKHEILSVAEVTTCEYEAVIL 
TPLLCSHPKYRFRASAVND I FCQSLPG 


10792 


24693 


A 


10881 


54 


335 


RE IFTMSGALDVLQMNEEDVLKISriiAVI T 
HLCCTKTDSPMEQIIYGS*TEYIYSIKL 
NRT*QNLMLAVP*NYDI*NHDEVTVMTF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=JLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














NNTLQMFEP 


10793 


24694 


A 


10882 


216 


3 


MS IKSMS I CCLRKKGKHGFQ IRGK*NSG 
RETERERERERKRARERARGQERERQKT 
SEGGRERERERKKE 


10794 


24695 


A 


10883 


3 


390 


GELKCS KEKCQSMS AKSRT I W 1 1 GAPFL 
KGQPRGGVEEGPTVLIKAGLLEKLKEHE 
CDVNDYGDLPFADI PNDSAFRI VKNPRS 
VGKAS E Q LAGKVAE VKRS G * ISLVLCVN 
LGGGLECLCGHAKVYSR 


10795 


24696 


A 


10884 


3 


375 


STMRAWIFFLLCRAGRALAVPQQESLPD 
ETEWEETVAEVTDVSVGANPVQGEVGE 
FDDGA* E T E * D WAENP CQNHH C KHGKV 
CELDENNTLMC VCQD PTSCP AL I GEVE K 
VCSYDTLTFYSS 


10796 


24697 


A 


10885 


95 


2 


KCTQGP S AVAHACNPNTRGGRGGW I T * G 
QEF 


10797 


24698 


A 


10886 


1 


114 


MGFHH VS QDGLDLVTL * GAG FS L PKCWD 
YRREPTHLAH 


10798 


24699 


A 


10887 


1 


368 


E PTME L TTAEKTNG KAAWR KVGAHAGE Y 
GAEALERMFLTFPTT * TYFPHFDLSHGS 
AHVKGHC KKVAD AL TNAVAH VDDM PNAL 
SALNYLHAHKLRLDPVNFKLLSH*LLWT 
LSAHLHVEFT 


10799 


24700 


A 


10888 


336 


9 


FRGQVI PAI FFPFQTTSLS YFQTGQPN* 
TPFFQKILKLTRGGGKLLDFQTPGRLRQ 
KREDPLFPGV*NCN* P*SHPWWGTKQNP 
VS KKKKKKLKTS S I FVHRVSMS P I 


10800 


24701 


A 


10889 


3 


362 


GFLIiAPLEMQELGTPMNRILQLT I AEQE 
TFLTPALLLPIPHQTYSTASAVPLAKPD 
T*PKDVGILALQVHFPAQYEDQTDLENY 
N1WKALKYTEDLGQTLMGFCSVQEDINS 
LCLTVEQP 


10801 


24702 


A 


10890 


66 


333 


TLPGNIGISFVERVMEVLRPQLIRIDGR 
NYRKNPVQEQTYQHEED* *DFYQGSMEC 
ADEPCDAY* VEHTPQGFRTTLRAPS *LY 
TPIVG 


10802 


24703 


A 


10891 


405 


49 


I PWMGTERTRASAQRDVAGPARNPACPL 
PEQCFPGPAEWTGPLTPPGSC*LRPGER 
ERHGPGCWGPRERLSSGQTHPPSPQGCA 
NTCPCHKHQQELLAGGVAVFYKLFYRYG 
NSPKWDL 


10803 


24704 


A 


10892 


2 


197 


PQPLRVLWTAHLAAMAPSSRTSLLLAFA 
LI CLPWLQKAGAVQTVPLCTLFYHAMLQ 
THRAHQL V I DT YQE LE E T YI P KDHK YS F 
LHDSQTCL*FSDSIPTPSNMEETQQTSN 
L*LPDIPAPGFCPDLPALASKGWCRPNR 
SAMHAFLPRYAPNASRAPTGH 


10804 


24705 


A 


10893 


3 


334 


DQLPEPLKVLWTAHLLAMAPGSRTSLLP 
PYALL * LPWLQEAGAVQTLP VSRLFDHA 
MLQAHRAHQLAIDTYHEFDETYIP*DHK 
* S FLHDS QTS FCLPDS I PTPSNMEET 


10805 


24706 


A 


10894 


3 


341 


LLTPGVSDAICYILDSGYI IMSDTFTAY 
VIG*RFEVNG*HATVRFAAWPPVALPW 
LG V* WDNP ERGT YD GTHE WT VY F KCRHS 
TGGF F I RSNKVNLGTD F VTADKNL YVLD 
Y 


10806 


24707 


A 


10895 


25 


351 


AMIQTRDLQGGRAFGLLKAQQD*RLDEI 
CTQLLDDLKYSNDEDLPSRLEGFKEKYM 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
0=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, •"=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














E FDL I GNGHI D IMYLKLML YKLAVPQTD 
LQLPRS IGEVYSGSGETISYPDFLM 


10807 


24708 


A 


10896 


364 


38 


FS QFKI AQFLKKI P FPGFS P FFQNGGNF 
LKFSKFFPNKRGFFPHPFF*KRGPFFCP 
KWGP QGH I GAHGP P AP QGQK I P P FQ VP K 
KMGIKGPTPPPGFFFFFFFFFFSW 


10808 


24709 


A 


10897 


136 


3 


STLKHR*RPGMVAHACNPSTLGGRSGRI 
T*GQQFKTSLTNMVKP 


10809 


24710 


A 


10898 


193 


269 


GNKYCLTILLLLLLLSFKRQSLTLLPTL 
ECSG*TRTPGPKGSSLLISrPS *VAGHGGV 
CL*SQLIRRLRWEDHLSPGV*GHSDPRL 
HHCTLTWATE *DS VS * KTTTTTTTK 


10810 


24711 


A 


10899 


316 


374 


CCCCCCCCCC*CCCCCCCCCCYKETNFE 

QQ 


10811 


24712 


A 


10900 


38 


206 


VYCVLVFTICTLLCNTSLGLFHPEFFFF 
ETESCCVAQAGLQWHHC*SLRLLPHRII 


10812 


24713 


A 


10901 


364 


2 


DYQHISPEKHCRPEGSGMVYLMCRKKKK 
RKKKNLSTKIIiNPQPSFIL*KPRNSARK 
S*QKEXTKIGADSLIIENRKRIEKIYIN 
ETMSWF I E I LNKI DQ PLARLTM I ERKDS 
TKFRNERG 


10813 


24714 


A 


10902 


132 


3 


ATSSPWGYSAYRVAILLFLYFSNKLAFT 
VL YGFV* N YFLQE I 


10814 


24715 


A 


10903 


234 


352 


NFCFFETGSCSVT*AGVIMAHCSLDLPG 
SSNSLTSVSEE 


10815 


24716 


A 


10904 


250 


356 


TME MMLD KKQ I * A I F L F * FKTGD KAAET 
TLNINNAL 


10816 


24717 


A 


10905 


280 


322 


QT*SLILVSLIICIATTNLLGLLPYSFT 
PTTQLSINLAMAIPL*AGAWIGFRSKI 
KNAL VL CTSTPLPTS TMLP VHMMD T S SD 
IHPKI * S IR* QTCIQHRLLLWLKC 


10817 


24718 


A 


10906 


271 


365 


KWRPGWAHT YNPS I LGSRGGWI T * GQE 
FKT 


10818 


24719 


A 


10907 


135 


358 


LRYKLTNPKLTLLTCGLI FLKGNIVNIG 
QCNGVHM*YQHFGRPGQDCLSPGIQGQH 
GQHKESLSL*KSLKISWS 


10819 


24720 


A 


10908 


107 


212 


I KNEGMGQVRWLMP V I PALWEA* VGGTP 
EVSHSWL 


10820 


24721 


A 


10909 


165 


383 


PKNRPIIPLKRSSISNPGDFQKSLRPGL 
TPMGPHFKRGNYSYDENLENLCPVGGNK 
GPGSH* GLLTCETCRGF 


10821 


24722 


A 


10910 


119 


291 


GGYRF Y VKDTF * K* GFWPL.TLF 1 1 LKPL 
AGHGGACLWSWRLMRLK* EDHLSLGGRG 
CSELWLCTLA* ATE * DPVS * KVRLLALN 
FVYNTKTFSRAWWCMPWLATHEAEVGG 
SLESGRSRLQ 


10822 


24723 


A 


10911 


283 


2 


SDNTTDTFLPFIYKYIFPITWEQGETWK 
KNPRNSRLKKTLRSETIAQIPLKCNLWP 
GMVAHACNP YTLGD * GGWVT *GQE FETS 
LANMARPSC 


10823 


24724 


A 


10912 


1 


360 


PHAFGTMKWVTFMSLLFFLSWANSRGGF 
RQNAP KS EVAHRL KD WGKKNFKAWG * MA 
LAQNLKQGPFENHGKLGKEVPEFAKPGV 
ADDAAEKGDKWLNTLLGNYLAPVAAVRE 
TYGERAEC 


10824 


24725 


A 


10913 


270 


361 


S KT WPGTVAQTCNPRTLGS QGGW I T * GQ 
EF 


10825 


24726 


A 


10914 


157 


1 


VFFQLSRLAWKGFSRFFFVFETESHSVA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=0Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














*AAVQWRDLGSLQAPPPGFTPFSS 


10826 


24727 


A 


10915 


1 


357 


LEEWGPEREFAEEEKEKKNTKKKKIKNS 
TPPKKPARKGEEKPGPFKRAITTIFVAR 
IPLFCLKGFLWPSC*IIGRSSPTPGSKG 
HTGAPNCPPCPLAAFPKDVPNFNPKMVE 
AVKNPIL 


10827 


24728 


A 


10916 


102 


471 


PSTPIILTSSYPHYVKSSVTSTFIISLF 
P T T I FM CL YQE F 1 1 SN * HLi P T TQTTHLS 
LSFKLDYFLIIFILLTFLVTWSIIEFSL 
*YINSNPNINQFFKYLLIFLITILILLT 
ANNLFQLFIG 


10828 


24729 


A 


10917 


2 


138 


RE PTMVLS PADKTNVKAVWGKVGAHAG* 
YGAWLFTWLTLNVISD 


10829 


24730 


A 


10918 


178 


393 


LVLCTVLLMWRFHFHDSLS ILKAQDVTE 
NLSNPTVLRGETGPSAVAHTSNPSTLGG 
*GKWITQGQEFETSLA 


10830 


24731 


A 


10919 


278 


124 


TAWADWGSNETP FLLKLPKKLT * GGGVG 
LE FQVLGRVRP KNPFNLGNQGFN* PKFR 
PWTSTWGAKQNFV* KKKKKNKNKKP*NS 
RHTPPP 


10831 


24732 


A 


10920 


221 


243 


1 1 IGS PLIDI KI S KCFLKI VAFGQAQWP 
VI PAGWEAEAGRS PE VGS PRPA* PGSLK 
VHVDNNWES IN 


10832 


24733 


A 


10921 


33 


292 


GTMWLE I RHAVEVQ CVLVSELV I PTSGD 
KPEQC*DHYIiIT*YLILGKWGI*"*ISGA 
LEKKKKKKKKKKKKKKKKKKKNKIKTGG 
FI 


10833 


24734 


A 


10922 


108 


3 


KLAVYGSVCL* SQDLGRLR* EDHLNLGH 
GGCSEP 


10834 


24735 


A 


10923 


106 


2 


KLAVYGS VCL* SQLLGRLR* EDHLaNLGH 
GGCSEP 


10835 


24736 


A 


10924 


195 


2 


ASLFKSQLNRAFIYLFIYVFIFETGSL* 
PMLVCSGVITAHCNLHVLGSGDPPTSAS 
QVAETMVL 


10836 


24737 


A 


10925 


127 


2 


YVCILKNLKGRPGMVAYACNPSTLGGQG 
GWIT*GQEFKTSIi 


10837 


24738 


A 


10926 


639 


282 


FFLS IKGWVQQFMPI IPTPWGLKQEDHL 
RPGLRDQ P VQNS KT P S LLK I * KLARRGG 
ACL*SQLFRKPR*ENCLSTGELKPKGGI 
FIRLVQNKIPTTRGEKRKQSHTGSYQCP 
KIKKKG 


10838 


24739 


A 


10927 


1 


363 


ALLTQALTCRQAGAEAPHAGATPSLMPP 
SLPQGFRDCSPSDAAYTMEMTIDKKQNQ 
VIILFKFKMGHKAAQTTRNINNAFGPEI 
ANKGTVQWRFKNFCKRDESREDDE*YAQ 
PSKVATDQL 


10839 


24740 


A 


10928 


53 


379 


TEAELLTLYLLPNALLNHFTSPPLMFAD 
RRLFCTNHIDIGTLYLLFGA*AWLGTA 
liSLLIRAELCQPGNLLCNDHIYNVIVTA 
HAFVI I FFIGLP I I IGGFGN*LFPL 


10840 


24741 


A 


10929 


180 


350 


EPMA'KGKTESPGPKRCGP* I*WVISQRG 
TLRFRGAGLFFMGEFLRLGENLLEIPRG 
A 


10841 


24742 


A 


10930 


361 


379 


RRYWWLG* VQWLMPVI PALWEAKAGRS P 
KARNL 


10842 


24743 


A 


10931 


121 


543 


HRNTGSTHASAHAYHIVHTNP* PLiTGAIj 
SALLMTS S LAMG IHFHS ITLL I LGLLTN 
TLTIYH*WRDVTRESTYQGHHTPPVQKG 
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NO: of 
nucleotide 
sequence 

• 


SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQ ID 
NO: in 
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09/^15,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H— Hist i dine, I=Iso leu cine, 
K=Lysine, I/=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine> R=Arginine, S=Serine, 
T= Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LQYGI ILYIT* *VFFFAGFF*AFYHS IL 
SPTPQLGGHWSPTGITPINPLKDPLLNT 
S 


10843 


24744 


A 


10932 


227 


339 


VGGVKSVQLLLNCHFSR*MKKKKKKKKK 
KKKKKKKKKKKKK 


10844 


24745 


A 


20933 


64 


409 


DQRNKS AHLRAHL KKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKRGE L * KK I N 
WAF F L P LLKRKF VGENFL KKNF FWGGN I 
GGQ I S S KKYRGGGK 


10845 


24746 


A 


10934 


2 


362 


NKLSGPYPEKVGYTLPVLMNPIiAQLVIY 
STIFAGTLITALSSH*FFT*VGLEIDML 
AF I P VLTKKINPRS TEAAI KYFLTQATA 
SIILLiIAILFNNIIjSGQ*TITSTTSQYS 
SLIILRAM 


10846 


24747 


A 


10935 


154 


3 


GCPLSPLLFNIWKVLVRAVMQEKEIKS 
IQIGLEVK*SLLTDNMIIiYLGN 


10847 


24748 


A 


10936 


250 


363 


TAWPGYLYSFSFLY*ETTKIWLGAVAHA 
CDPSTLGGRG 


10848 


24749 


A 


10937 


199 


339 


NVLGI LLVFR 1 I VE I YKCGRLWLGT VAH 
TCNPSTLGCQRGQTT*GRE 


10849 


24750 


A 


10938 


130 


240 


KNEQDPRDL*DNDKWPNIHVIGVPEEDK 
DNGTERVFD 


10850 


24751 


A 


10939 


35 


235 


FILVENTKKMCICPLLNMIYAILYLFVP 
SVFL*EENKQ*GITEKKKKKKKKKKKKK 
KKKKKKKKI KPGGL 


10851 


24752 


A 


10940 


179 


12 


DQPGQHNKTLSLQEMKNKSS WVQCCV* P 
QKLGRLRWEDCLSPGGRSCEEPCSHSG 


10852 


24753 


A 


10941 


297 


278 


INQDNE LTL I NQS S KQ I KHKN INQTLRT 
KMNENLFA*IIAATILGLSATVQIILFP 
PILIPTSKYLINNRLITTQQ*LIKLTSK 
QM I T I HNTKGRT * S LLGGD 


10853 


24754 


A 


10942 


3 


318 


FGGGRGVRNYPDAFVLSVLLPSRLLFPH 
LFPHLFPLPPFLGLAPYCFLTL* YF* K* 
*L YYS QS 1 I FFF YREMKNKLLVFWAINL 
FFLYYYNYMEMWTLICYWKN 


10854 


24755 


A 


10943 


190 


3 


KLSHKQSVYFKTPFI*KNHVLFK*KNLW 
VYPNDKFFFFFFESLALSPRLQCNGTIL 
AHCNLR 


10855 


24756 


A 


10944 


281 


1 


KKWLFSSYSSLYGNGLFLIPPFLRVSGF 
GKGFWKKFFFLRARGALFWGSPLKGFFL 
GFFWVFFFF*DGVSLLLPRLECN"GIISA 
HCNLRFPGS 


10856 


24757 


A 


10945 


169 


319 


NGVEDAFKNMVLGWAQWVTS VI PALWE V 
KVGGS PE VRS FRPTRP I * KYGFRLGTVG 
YICNPSTLGGQGRRIT*GQEFQTNPANM 
VKPG 


10857 


24758 


A 


10946 


312 


1 


FARPGLLKSWDFQP * PLDPVWGFFFKRF 
PENWLEFEIFPFLI IMAKRKKLSKSLFP 
FPFLG* FKD*GQKTPFFFFETVSIiLLPK 
LECNGAISAHCNIiCLPGSSD 


10858 


24759 


A 


10947 


208 


329 


I FTGDRSRNNR IG * ARWLT P VI PSLWEA 
EAGGSLEPKSLR 


10859 


24760 


A 


10948 


116 


374 


FYFGYLLFFFCFFEMAPCSRG* SWSARG 
QSF CNLI FPGS S DLYVS AFRVAR I TGAR 
DHAC 1 1 1 VFWDGGCVDTI TSGLAGWLR 
RG 


10860 


24761 


A 


10949 


264 


434 


QLAFCTDTLTS VREQCEQL * KCVKARKR 
I EL CDEQ * S CRSHTD * DCTDELFDFLHA 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=GIycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, i 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














T 


10861 


24762 


A 


10950 


226 


429 


NFGFRLDKTVFFSKIRINAWAQWFMPVM 
PELWEAKAGGIT*AQELETSLSNIVRPW 
LYKKFKN*VWWC 


10862 


24763 


A 


10951 


188 


433 


YSAEWKIDLGIEVFWVGKMTYKQ*FFWF 
VFKFLFF*RQGRTIiSSKLEWMSVITGHC 
NLDLLGSRETFASAFQVTWTIGSCHS 


10863 


24764 


A 


10952 


298 


445 


L»FS KCSSKSI KEEGPGWAHACKPGTLG 
GRGGW IT* GS E FE S SLANM VK 


10864 


24765 


A 


10953 


244 


1 


KTPQGLWFCLFKKS PLKGI KGVSMGVL 
ICFFLKSRINLFNPRGLGQRKNFFFFFF 
■ E TDS YS VAQTGVQ * HNLGSLQP PS C 


10865 


24766 


A 


10954 


199 


3 


QETKKEQMKENKQIK*RSTRKKHRQGTN 
KT KE RGE RQTP P VGNRQT P T LG I HAR P R 
" RRATTSPRA 


10866 


24767 


A 


10955 


157 


3 


YSYTFSFIITTS IL X IQTLSGHGGACL * 
SQLLGRLRQDNHLNLEHGGCTRA 


10867 


24768 


A 


10956 


263 


357 


GLS LKVLiTQWLGAVAHACNP S TLGGRGG 
*IT 


10868 


24769 


A 


10957 


333 


2 


KWKADML I PLVI KS FFLGI FFKMFTNS K 
RYIFLLQLLAQHSSFFSFIRCMEL*WPY 
P I PLC YGQSNQP S V* LAYCRDL FNEL I F 
FFFFETEFHSCCPGGISAYCNLCLSC 


10869 


24770 


A 


10958 


130 


228 


GS S FLGGS P SVAQAGVHLPDHGSLQ *DK 
SPSVQKMSKWIGCSGACL*SQLLRRLRC 
QGRL S P * GQ AC S E P * S GR C T P AWATEGD 
PPKKLLPYTTDS 


10870 


24771 


A 


10959 


250 


3 


VQACSITGALYQRRNAEDEPQTARPISG 
FTTSIAMRIiMLLTCSGHIWPVADLAFSG 
IMP*GLFFPSRQNIiALSPRLECSARA 


10871 


24772 


A 


10960 


199 


326 


VLGRMWSNQKNYTLLARM*KIK* *3SJNTR 
CWGGCGATRRI IHC*QECRI *KCQPQSL 
WRTV*QFLNTLNIR 


10872 


24773 


A 


10961 


323 


3 


LLFFHLPDNWQH*YVFCHYNLSFLQFYI 
N* I IQLWFCVWLLSMS I ILKSHPCHFS 
K*LSTVTFFFRQSRSVSRLEWSGArSAH 
CNLCLPDSSDSCASAS*VAGSC 


10873 


24774 


A 


10963 


120 


3 


PFFFFFF*EMRSCFVAQARVQ*CDQSSL 
*PSTPGLKQSSC 


10874 


24775 


A 


10964 


168 


441 


LTPVI PALWEAEAGRS PEVRS SR* S PP 


10875 


24776 


A 


10965 


19 


207 


APLKLNVAMELSLGQWDVSRSNLWEIPL 
KKGDTGRVQWLTPVI PALWEAE VDRTPE 
VTSVTRC* PLKLNVAMELSLGQWDVSRS 
NLWE I P LKKGD TGRVQ WLT P V I PALWEA 
EVDRTPE VTSVTRC 


10876 


24777 


A 


10966 


209 


1 


TKSALSNMWTTHIGLCKNFCIVL*DRV 
SVTQIGVQWHDLSSLQSLSPRLKDPPTS 
AS RVAG I TGMHL V 


10877 


24778 


A 


10967 


109 


1 


DYLRSGV*DQPGQHGKTSSLLKIQKLAR 
LVGPIISC 


10878 


24779 


A 


10968 


141 


2 


GLAMLPRLENI I FRPGTVAHACNPNTLG 
GRGGQIT*GQEFKTSLSC 


10879 


24780 


A 


10969 


175 


348 


LVYECKHILCVFNTSLFFPH*TCYK*RG 
VLWLGE VAHACS PGTLGGRGGWIMASGD 
RD 


10880 


24781 


A 


10970 


108 


2 


KTKKLHMRSGWAHTCNPSTLGG * GGWI 
T*GQDSC 


10881 


24782 


A 


10971 


184 


2 


SFLWKLHLRGVPGRVRCQSAPTRGASQL 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Grycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GYLGVRDPLEEAV* LFSDLKLCAGKTTT 
LFLV 


10882 


24783 


A 


10972 


245 


339 


SQ I SGPGTVAHACNPSILGGRGE WIT * G 
QEF 


10883 


24784 


A 


10973 


295 


340 


KRVFF * ARKKNPFLSPLWGLTLENPLGF 
RLFLNPFLGLKISQFFFPRQI*PLAFFF 
WGTLPRPLNLKFGPFFFFFFFFFFF*DG 
VSFCCPGWSQARRYIEINISLSTCPRA 


10884 


24785 


A 


10974 


186 


340 


RPYLTIRFKGHPPNEPIQ*SFFVLFCYC 
FDTESRSVTRLECSGSNSAHCNL 


10885 


24786 


A 


10975 


253 


491 


FLECTSLLKRQRVHPGQHSKAPPTKELQ 
TTGRGGAIiLQSHLLRRLRQEHCLGPGVG 
S YS E P * LHHC T P AW VI EGD S S KK 


10886 


24787 


A 


10976 


141 


1 


SFSFFKFSPTGDMIGFF*HFFFFLRRSH 
S VAQAGVQWCHLS SLKPRA 


10887 


24788 


A 


10977 


3 


138 


HEETGFHLVSQDGLNLLTS * STHLGLPE 
CWDYRREPPPGREGDF 


10888 


24789 


A 


10978 


186 


323 


YESRSATQAGVQWCELGSL* PSTSRFQ* 
FFCLNLSSTWDYSGIiAPS 


10889 


24790 


A 


10979 


183 


3 


IKILFPFFFLRLSLALLPRLECSGTFSA 
HCNLCLLGSSDSAS*VAGITGTTTPSPM 
QLV 


10890 


24791 


A 


10980 


309 


55 


KNFGPIW VKFLGGKGGEMAFLGKFFFP F 
FFFFEKESWFVFQG*MQWGDFRSLQGPP 
PGVTQ I SRLGKI GGPLFPKKKKKRERGN 
M 


10891 


24792 


A 


10981 


637 


838 


SQHLGRPMRVDQLRPGV*DQPGQHGEMP 
SLLRIQR* AGHGGTHL * S Q VLRR VRQDN 
CLNSGGGGCSE 


10892 


24793 


A 


10982 


275 


2 


RNRTLKME FLiS W F WG FWNWLLNM I RP KN 
VKDS TS KSMENTDS PWHELFKELGKINA 
FDTPDSLIiVRGKFSDSIHNTFDHM*RTK 
EYNEARA 


10893 


24794 


A 


10983 


98 


346 


GHGHAT LRGLC VLT FS FH I TALS VSGTN 
DAEDCCLCETQKP I CGYIERNLLYLLIK 
D VCRVPAW* WERVYSL I S RYS LWRD 


10894 


24795 


A 


10984 


30 


410 


LPEFTGRPKRTRTRGFSTNHTDIGTLYL 
IiFGA* AG WGTAVS IiL I RAELGQ PGNLL 
GNDHM YNAI VT AHAFE 1 1 F F I VL P 1 1 1 G 
GFGN*LVPLIIGAPDMAFPRINNICF*L 
LPPSLLLLLASAIAE 


10895 


24796 


A 


10985 


141 


360 


QTLRT KMNENLFAS F I APT I LGL PAAVL 
IILFPPLLIPTSKYLINNRLITTQQ*LI 
KLTS KQM I T I HNT KGRT 


10896 


24797 


A 


10986 


3 


347 


HELRTKVNEHLIASFMGPTSLGLPAALL 
IIL*PPLLIPTSKYLISNRLITTQQ*LI 
KLTSNQMITIHNTKGRT*SLILESLII I 
IATTNLLGLLPYSFTPTTQLS INLAMAI 
PL 


10897 


24798 


A 


10987 


46 


317 


KSMTPIRKINPLIKLINHSLIDLPTPSN 
ISA**NFGSLLGACLILQITTGLFLAMH 
YSPDA*TAFSSIAHITRDVNYG* IIRYL 
HANGAS 


10898 


24799 


A 


10988 


245 


1 


VSCLLEGKIiTNRKDIHTKNPSVHHHHQR 
PKVDKTTKMGKKQNRKTGNSKKQTASPP 
P KK* S S S PATEQ S WMEND FD E LRE E 


10899 


24800 


A 


10989 


255 


1 


S CLPWANWI SVQNQHKKTFLVIRTFFPI 
GS FHVT * LLGPPL ILI SPPRI FFFFETG 
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C=Cysteme, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=GIycine, BMHistidine, I=Isoleucine, 
K=Lysine, L==Leucine, M=Methionine, 
N^Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown ? -—Stop codon, /==possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














SFTVARAGVHSGAISARCNLRLLGSSNS 
A 


10900 


24801 


A 


10990 


166 


307 


GEKPWGPLIWGGLVGRSSWASCQVLELL 
* EMVFRHVAQAGLKLLS S SNP PASAS Q S 
AGITGVSH 


10901 


24802 


A 


10991 


109 


330 


TNQFKTKKEREAGKKKVKELERERERGK 
KRKEQRKNEKERR*P*NMD*RRIRERNH 
SFDVYEFLICGLLNLLHV 


10902 


24803 


A 


10992 


244 


326 


grylkgwldgpa*awlgtalsllirpe 
lghpaillgidlissvivtahawsil.lv 

RVI Pit IAGFGN* LVPLI I GAPDMAFPR 
INSISF*LLPPSLLLLLPPPI*QMRAGV 
GEREVRVR 


10903 


24804 


A 


10993 


101 


2 


KTFWARFVGTCL*SQLLRRPRQEDHLRL 
GGRGC 


10904 


24805 


A 


10994 


177 


1 


TPSLLFFVNI ICFCLNPGGGGYSEQKLC 
LCTPAWVTE *NS I SRPSQNNSRQSRQVN 
HLF 


10905 


24806 


A 


10995 


214 


326 


KEERKNPRAIRVV* PWGFFFPLFKEISL 
FLFVFRFWHGYPFPGAFFSTQKWLVF* T 
MVLFFFFFS *VSLLPRLECSG VI SAHCN 
TFLPGSSDC 


10906 


24807 


A 


10996 


184 


379 


LCGM V YHFP SILLLYL* CKL F F S P KL E F 
S S C C P LE CNG T VLAHR S LRL P ASGD S P T 
SASRVAGHGGTCL 


10907 


24808 


A 


10997 


136 


3 


GRVDGQHIMTHQSHAYHIVKPSP*PLTG 
ALSALLMTSGBAM* FH 


10908 


24809 


A 


10998 


274 


363 


PWAYAE P LTLHDAT YS CT * S L I RAS FTL 
F J A * TTRLRLLP * S FTP TTQL CMNLAMA 
IPL*AVAPDIGYSSMITNALSHLLPQCT 
PTPLISILGIIETVSLLIQPITLGVRRT 
ANIT*CHLLMHLIGSATLSISTI 


10909 


24810 


A 


10999 


305 


1 


NLHTTPSQIPLT* PLPNINIHNLHLTTT 
Q * I K YL I L I P SNLFHNKPJNTL KTAYS VP P 
S PNLHLPS S SNS PASAS QVAGNAGARHY 
AWL I FVFLVE TGYS LLV 


10910 


24811 


A 


11000 


154 


2 


IWGILQSRRCGSRL*SQHFGRPRQADC 
FTPGVPDQEYFTLLPEWFFLV 


10911 


24812 


A 


11001 


142 


367 


GVCLYQRI S I YI KKTKKLSQGLF YFLLL 
LLLF * EGGSHS VTKLECSGGVS AHCNLC 
LLASSHPPTSSSQVAGTTG 


10912 


24813 


A 


11002 


60 


385 


GNTLL SHTLRAHL I ENLHAE F I DRT I LG 
LPAA*LIILFPPLLIPTF*YLINIRLMA 
NRH*LIKLTSERMITIRNSIGRT*SIIL 
VSLI I T I ATAN LLGLLP YS I TRTT 


10913 


24814 


A 


11003 


214 


83 


S KKS AS P LTFN I VF Q VLDNS VRE EE I R Y 
TEMEKEE I KLS * FVD 


10914 


24815 


A 


11004 


241 


368 


S YLL * PS P * PLTGALS ALLMTSGLPMRF 
HFHS I TLL I LGL LTNTLT I YQ * WRDVTR 
ESTYQGHHTPPVQKGLRYGIILFITLEV 
F FFAR F F * AF YH Y S LAPT P Q L * GHWP LT 
GINPLNA 


10915 . 


24816 


A 


11005 


3 


396 


HEPHALGMPLTADLPSMASCSQTSLLLL 
LHLLHLPWI * EARAYQAAGCS KLFDHAM 
LQAHRAHQLT IDTYQE VEET Y I PEDHKY 
SFLHDFQTSFCFSDSIPTPSNTEETYQK 
SNLELLRISLLLIESWLEP 


10916 


24817 


A 


11006 


156 


335 


FGCCCCFLFF * LVYCCYVMMLYYI YVL I 
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E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
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T=Threonine, V— Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SFGFYF*YWIM*YFYRLVYNIYLVLFYL 
LiYMS 


10917 


24818 


A 


11007 


107 


2 


KCS S S KHFTKEDS Q ITNKH I E KCSS * LL 
VREMQII 


10918 


24819 


A 


11008 


129 


412 


KTRSGFKKTPGQKGKTPLLKKTPKITGP 
GGGALKPPFSGGFRPENPLNLRDKGSSG 
PKTPTFFSPWGKK*NFFQKKKKNLLYPG 
KIFFFWMKNI 


10919 


24820 


A 


11009 


63 


268 


CTFKFKNHCS T I Q S TT * LDRS EKKKKKK 
KKKKKKKKKKKKKGGPL* KNFSLKGG 


10920 


24821 


A 


11010 


154 


422 


NEFSFFFFFWKPKPPFFAQVGGQYRNWG 
*LKPRPPWLKNFSGFASGEAGITGGVPH 
PG* I LVF * VKAGFH I ENLVWNS ETGNQ P 
ACASQ 


10921 


24822 


A 


11011 


23 


375 


TRYLTKIKKKKKKKKKKKKKKRGAPFLK 
TPWGAPFFPGSAIFIFFFFGGFF*TPLG 
FFWETLFFWGGKILGHLSPKILPFWGKK 
NFFWVKGGKNLLNSPFLKIFFLGVFFKK 
FFPPG 


10922 


24823 


A 


11012 


151 


361 


ACTTTPSSFFSNIDTVSLCCPG*VPGHN 
NSSLINFPSFLRSWYYRHAPPHPAHFFP 
ILIRSRYVAQAGLEFLALRDTPISDCPC 
TVHYPPSLAN*SVFFFFLTQSFVSLPRL 
KRK 


10923 


24824 


A 


11013 


146 


378 


NFWFS I EFLH ICY* I YS F I CI, I KVFL * 
* KKKKKKKKKKKKKKKKKKKGENLKGAG 
GGKKFLGGGKNNS FFRNGGCF 


10924 


24825 


A 


11014 


82 


264 


TELFLYTICNWSAILYILC*RNKVFLSF 
INTEKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKIERKI 


10925 


24826 


A 


11015 


381 


2 


GRTALFTLLKGPIHRPQMFFTPLRVFPG 
SYTGISIRQSGFFFIKRVIFFRIj**KALi 
FLLPFLLPFFSFFLKIGTYSVAQECNGM 
I I DN * NFE LLGRRNP S T S AS Q I AGNTS A 
Y*HTQLVFLCEDG I 


10926 


24827 


A 


11016 


1 


251 


PNC I VHS LRTGTLWFI ILLLLCFTFE * F 
LNKNL I FESQKKKKKKKKKKKKKKKKKR 
GAPLKKTPGGPKFYPASKKKISPQKGG 


10927 


24828 


A 


11017 


234 


1 


DINIKGSWVKGIPKPSVLSLQLFCNSKI 
FPN* KL I KKKNPNE PGVMVHVCNPSTS G 
GRNGQMA* GQ E F KAS LANMAKP 


10928 


24829 


A 


11018 


107 


342 


FQLCIGAIVHCFLFMKQHSEYKSTDHRA 
SSKCHTVQGSSQLSTSAFKQK*THKKKK 
KKKKKKKKKKKKKK * KKKGGAS F 


10929 


24830 


A 


11019 


264 


1 


CFKPLLFKYPGI IKAVNLPVSTASATFH 
KF* YVTFS FS * KKFLCVGGKTGSHS ITQ 
AECSGTIKAHCSLELPGPGDTPTPTSRV 
AARA 


10930 


24831 


A 


11020 


1 


288 


RTRGTSSRSRAATLFFFFPFKKGGFFNG 
NLPTTFGKLGNKKFPLNLFPGGPPLPHG 
GLKKGPGAPQVPPLFLKKPQTSPP*GLR 
EGPPPLKKFYLF 


10931 


24832 


A 


11021 


115 


287 


S TCE I S FGCSRGCCWDYLMVWEFSHFYF 
*FLETECCSVA*AGVQWSNLSSLQSLHP 
E 


10932 


24833 


A 


11022 


186 


1 


RPKVDKTAKM3RDQSRKA*NFKNQKASS 
PPKEYNSSLAREQNWMENEFELTEVGFR 
KSVITN 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 


10933 


24834 


A 


11023 


180 


353 


I FCFCI FSRDGVS VC* PGWSQSPELMIH 
*NTKIS*AWQRVPKIPATQEAEAGESLE 
PG 


10934 


24835 


A 


11024 


155 


1 


PQARRGGTCLQS * LLRRLRLRQEDRLS P 
GV*GYSEL*SCPCTPAWETERLV 


10935 


24836 


A 


11025 


22 


325 


KLILLYDPQLTFIFLNTRAWAHCAFCC 
WPIHFVFGNYGFVFSCLFTFIFVYSGKA 
DLFFKPKKKKKKKKKKNKKKKKK* KKGG 
RGLKNSWGGPIFWGFPK 


10936 


24837 


A 


11027 


127 


259 


GQVQWLTPVIPTLWEAMVGGSTKLRSSR 
PFFSTP*PCIVMILYG 


10937 


24838 , 


A 


11028 


37 


408 


I ASGAL F F F FKAAG KRGDF KTEGAYQRQ 
RT I FKNKKRARRE KTGRENLRGNYKNMG 
RGLKTPRGALGGPYLDKKGPLRGKGPFQ 
GGTLLGGGPKIKIQGTL*IRRNYWPYIR 
RYNRFKKRQKNM 


10938 


24839 


A 


11029 


146 


3 


LLVLKDLPRIMVRSAPTYIYIYTYLYIY 
IYIYTHTHTHTFIT*VLFC 


10939 


24840 


A 


11030 


268 


435 


LMCLRNETN* F * KVE * WLGAGAHTCNPK 
TLGG*GGRIA*GQEFETRLRNIVRP 


10940 


24841 


A 


11031 


322 


2 


KLPHLQQQCVKWCYKPFLKNGKIGHGG 
VCL*SQPLRRLRQENHSNPGQCTSAWVT 
QRDSVS KNKKKRKMEKYFSSVRQNKPVK 
HTKRMS PTKADTKS THCGGGRV 


10941 


24842 


A 


11032 


251 


380 


HKRLHTI YYFVPLKKKMGPGLVGHTYNP 
S TLGGQGGQ I T*GRE 


10942 


24843 


A 


11033 


64 


324 


FWFFFFFFFFFLNPPGGGKKKKKPPPPL 
GKKKKRGKKGGEKFFSPPPRGGGGGPKK 
ISLKRVLKFF*NPPGGTKKGGGPFLIPP 
GSP 


10943 


24844 


A 


11034 


420 


534 


CPGWAHAYNPSTLGS *GRQITIDREIE 
T I LANMAKPH 


10944 


24845 


A 


11035 


196 


1 


KNGHSLPPGPQKGTPFPKKKKTPKINLS 
ELGKSKAYSWPGTWHACNPSTLGG*GG 
WIN*GQEFE 


10945 


24846 


A 


11036 


10 


248 


P S DR * L FS TNHKD I GTL YLiL FGA* AG VL 
GTALS LL I RAELGQPGNLLGNDH I YNVI 
VTAHAFVI I F 


10946 


24847 


A 


11037 


32 


405 


D YVS KR KE KREKKRNV I LETS I S S HLVE 
WMLCSRYHPLIKKRSSVRILIALLCPQK 
VRDMS *GIGSKKQCWDSWIiSIWIKMESD 
PFLIPYPKINA*RLKDLCERINLKI IIK 
REYLHDYRVKKV 


10947 


24848 


A 


11038 


337 


448 


KNPRGFFGKNPFFWGGPFGGPPPPKKMG 
KKKTG 














10948 


24849 


A 


11039 


307 


415 


RFVCSTIKVLRDLSSDRSNPGRVLSTSN 
SSLK*KKK 


10949 


24850 


A 


11040 


213 


403 


VHRG I KY Y FLNKL P D F PRQT F RK* G * RL 
GAVAHACDPSTLGG*GGWITLGQTFETS 
LTRMGKS 


10950 


24851 


A 


11041 


104 


2 


SAFFF * ETES CS VAQAGVQ WCDLS S LQL 
PPPGFK 


10951 


24852 


A 


11042 


177 


3 


AYTAYICVCIHIYLYK*MYICINVHVCK 
ITYLYIYKYLKYNAIYLHIGFFSFLHTI 
S 


10952 


24853 


A 


11043 


314 


395 


TVYILNLS INS VQS VLLS VFHRLRHQPR 
GRVQWLTPVI PALWEAQAGGS P * VR I SQ 
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IKTPA 


10953 


24854 


A 


11044 


310 


403 


RGKGGKKPRNPGGGKS *ALTKKKKKKKK 
KKKKKKKKPGGALKKKPRGGQKKTGGGK 
KKFL P KRGAKKKP P GD FGKKNQ I GGGE K 
RGKTPP KKKKP * GKKKNLKGKGGKKTP K 
SGGGKKFPPRW* KKKLPPGG 


10954 


24855 


A 


11046 


181 


3 


EGPLSLSLLFFFF*RQISSLLLRLKYSG 
TI IAHCSFKFPSSRNPPTSASQAASTTG 
VHH 


10955 


24856 


A 


11047 


392 


41 


KESRSLSQGGREKGPFKFLAPPAPKFKR 
IFRPHPLKKLGPKGPPPSPS* ILSLKKK 
GGPPILARWFWNSCPQGFPPPGPPKRLG 
FKGGPPPPPLF*KKKPPFLGGENQKVKN 
FFFFF 


10956 


24857 


A 


11048 


147 


307 


TVI FVFLLRKGLTLLPRLESGMIMAHLK 
S* PPGLKQSSCHSLLSSWDYRRRRG 


10957 


24858 


A 


11049 


118 


1 


HGKIPSPLQ*HK* IAGYGSACLQSQLLR 
RLRWESRSISI 


10958 


24859 


A 


11050 


54 


284 


RKIRRGGLHLWSNLLGRLRWEDLLGRLR 
WEDRLSPRRGGCSEPRLCHCPPAWATE* 
D P VSKKKKKS LFVE KPQGGG 


10959 


24860 


A 


11051 


130 


340 


HNMHFAAHGSRINFDFFFFFFFFERGFP 
FCPPVGRAGTHFGLLEPFPSRV* KQ YYW 
LGTVAHVCNPSTLGGHVGES*GQEFKIS 
LANIVK 


10960 


24861 


A 


11052 


140 


337 


NIIMFFFFFFETVSFLLLGLEGKGTIWV 
N*NLCLPG*GDSPLGCS 


10961 


24862 


A 


11053 


197 


428 


IRS INNP TV AS QYS S E WKS H I PL I LNQN 
LDMIKFSEEGMLKAKI GLLRQTVGQVVN 
AKEKFLKKNQSATPLS I * IRK 


10962 


24863 


A 


11055 


180 


1 


AI IALLYSRLGDYARRFHLRRKKEQWQG 
MVTHVYNPSTLGGQGGRTA*TQEFKSSL . 
GNIA 


10963 


24864 


A 


11056 


158 


426 


FFPLPLPLPTVSLFPRSPSDAEPKLDCT 
AAI SAHCNLPA* FS CLSLPSACNCRRAP 
PRLiTASASRGAGIADGVS FTQCSMVPRL 
ECSGV 


10964 


24865 


A 


11057 


220 


421 


YAIiHLNMK3SINSYF*MRKKKKKKKKKKKK 
KKKKKKKKQNKKDPGGAVYKKI P 


10965 


24866 


A 


11058 


162 


464 


SCSVGLKLFSMKTSLALSYYCLLLLLTL 
H YY YY Y YY YY * SLVFM 


10966 


24867 


A 


11059 


1 


134 


APENRVD PRVRKTLVPL I LP 1 1 TTLANP 
CKKD* YP YYVKIS IAC 


10967 


24868 


A 


11060 


382 


273 


S S CL* PQLLRRLRQED YLS *GA* GCNEL 
**CHCSPAWVTEQDPVSKNIHTYIHSQS 
INYGSM I G Y IHEQKGI I ADHKP 1 1 AE PE 
DSTYPRDHT 


10968 


24869 


A 


11061 


298 


378 


SCSASPCCSSRHGWSPCG*QTCLLLAIA 
LRCLPWLREAGALHTGPGCRLFDHAMLQ 
AHKAHQLVIDTYQEIGENYIPQDQKNSF 
LLESHTSFCFSDSITTPSNMEETQQKSN 
LKLLRISLLLIETWLEPVRFLTRIVANN 


10969 


24870 


A 


11062 


39 


486 


RPTRPDAYHIVKPSP* PLTGALSALLMT 
S GLGMR FHFHS ITLL I LGLLTNTLT I YQ 
*WRDVTRESTYQGHHTPPGQKGLQYRIM 
LFITSEVFFFAGFF*AFYHSSLAPTPQL 
GGHWPPTGITPLNPLEVPLLNTFGILAI 
GGS IT*AHH 
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C=Cysteine, D=Aspartic Acid, 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro\ine 9 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
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10970 


24871 


A 


11063 


269 


429 


ATWQKL PLQ I QKWYPGLGLVAHACNPS T 
LGGQGGRIT*GQEFLNQPANMVNTH 


10971 


24872 


A 


11064 


248 


65 


I IEGQAQWLMPVI PALWKAEAGGSPEVG 
SSKPA* PCDPPASASQSAGITGVSHRTQ 
L.ILNS 


10972 


24873 


A 


11065 


176 


1 


LFPPPKKVPPWGNTQFKS I CLYTTNFFF 
FETESRSVARLECRGTISPHCKF*LPGS 
RE 


10973 


24874 


A 


11066 


2 


397 


MVLWTAHLRARAPGSRTWGLLTFALLCL 
P WLQEAGAVQTVP I SRLCDHAMLQAHRA 
HQLAIDTYQEFKETYIPKDHKYSLLNDS 
QTSFCDSDSIPTPYNMEETQQKTNLEVL 
RMSLLEIDS *LRPAQSNKR 


10974 


24875 


A 


11067 


172 


399 


Q I LYL * KYAYTVTMQHLSHP IL* KKKKK 
KKKKKKKKKKKKKKKKKKKKTGG 


10975 


24876 


A 


11068 


192 


3 


PKXTPMLKLIVPHIILLPLT*LSKKHII 
* INTTTHS I I I S I I PLLFFNQ I TSNLFS 
CSPTFC 


10976 


24877 


A 


11069 


204 


345 


DKSGEHSKTSS IQKNLRLGAVTHTCNPR 
TLGGQGRWTT*SQEFQTSP 


10977 


24878 


A 


11070 


3 


416 


HELPQPLRVLWTAHLGATAAGSRTSLLL 
DFALLCLPWVQEDGAGQTVP*SRLFDHA 
MLQAHRSHRLGIDTYQEWETYIPEDRK 
LS FPDDCHT YFCF * HS I PTPSHLGETLL 
TSNLELLRICLVLIDSWLEPARILTS 


10978 


24879 


A 


11071 


108 


2 


PSPFFFETESCSVT*AGVLWRDLCSLQA 
PLiPGSSC 


10979 


24880 


A 


11072 


400 


278 


KVK * LRQ ENQWNS RGRDC S E PRS RHGTL 
PWTTEQDS I PKNKTKPYKI S 


10980 


24881 


A 


11073 


139 


3 


KLAGCGRAHLSFQLLRRLRREDHLSPGG 
*GYSEPWSRYCTPASRA 


10981 


24882 


A 


11074 


17 


421 


DHVIGQPRWLMPVILALWEAEPGGLLEP 
KVQDQPGQNCKSLSLLTIKKKRRENFPG 
RGGTQLCSQLLRRLRVGCCIDPKLHNCL 
PAWMTERDPTFKRE * KKNDVLEAKNFFT 
I IjT FMNT T KE KMVAQR I GS L FHG 


10982 


24883 


A 


11075 


412 


1 


QFFFLGVCCPTQ I FGVGKKGE KKKKKRG 
FPPRVFFFFNPQIFSPFFFPGPFFFLGK 
FPPFFGERFLFFSKSKFFFPGVFKRGFP 
FPPKNFFFF*RIF*KFWFSPPKPFFFFF 
FFF*DRASLCHPGWSTVAQS*LTSC 


10983 


24884 


A 


11076 


279 


410 


VFCCVAGRDGAPGVKAHFPAAQKPWDGG 
FPGP * S AGVGPP P I PR 


10984 


24885 


A 


11077 


12 


386 


I AHLLLFS F YHKDTGTL YLL FGAGAGVL 
STALSLIilRAELGQPGILVGNDHIYNVI 
VTAHAFGI I FFI I I P I I IGCYGN* LATL 
1 1 GAPDMAFPRINNISFGLLPTSLLLLL 
AYAIVDAGARTGW 


10985 


24886 


A 


11078 


412 


3 


LAILSLSLSLGFPYSLRYKNIKIRPINN 
PTMAS KCS S KRKTRTS FTLNQNLEMLKL 
I E KSTAKAKRLKVRPL. VP VS QWNAEEK 
FLKEIKSAIPLNIQMIR*QTSLIADMEK 
VSVL* IKDPTSYNIPLSQS* IHSK 


10986 


24887 


A 


11079 


78 


443 


RHRLPSDTCBTAAVP * PCKGSKKKKKKK 
KKKKKKKKKKKKKKKKKKKGGGGFKKNS 
GGGQNFAGGKKIFFFFFGGVFKNPLWIF 
WKKTFFWGGKISGQLSQKKLSLWGEKKI 
FWGCGGETSF 
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K==Lysine, L=Leucine, M=Methionine, 
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T=Threonine, V=Valine, 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10987 


24888 


A 


11080 


395 


2 


KR VP KKKPKKNS P P GGRKGGS P P * S P P F 
GGPDQP I F* VGGFDPPWLTPENPFFFKK 
KNN* PGKGGMPL I LKSLGGWGKKKI FTP 
GGGL P KN* KE T P P P P PGGKKK I P FP KKK 
KKKKKRAAARDLELVPTRP 


10988 


24889 


A 


11081 


24 


182 


TVS TTKCFSLLI P YYGRKDME * KI * I XQ 
KKKKKKKKKKKKKKKKKKKKKKKKKRGG 
PP 


10989 


24890 


A 


11082 


173 


3 


NNVI CYILYI PS VPKYKTTLNMRGSAFF 
FFFFFIF*DGVSLCHLGWSAVARSWLTA 


10990 


24891 


A 


11083 


284 


391 


SVAHACNPGTLGG*GEQITRGWEFETSL 
TNMEKARL 


10991 


24892 


A 


11084 


257 


3 


I KKLRE VNMLGWL Y YMREE ARP VD YVHW 
RGPE YS SLTKT IRNVIVKGHQHY* KASA 
GRGGSRL*SQHFGRPRRADYLKSGVRDQ 


10992 


24893 


A 


11085 


42 


171 


ARRMRAG I HRVS Q EGLDLIiTS * SAHLGL 
PKC WEDRRE PLRLA 


10993 


24894 


A 


11086 


176 


325 


FKRS KKKKKKKKKKKKKKKKKKKRGGAL 
LKKF* FKRSKKKKKKKKKKKKKKKKKKK 
RGGALLKKFLGGPQISGGGKRFFFFF 


10994 


24895 


A 


11087 


67 


379 


KKKKKRKKP KT KKRTGP VRQNVRWGR S L. 
SPGRAPAEGPQGGSTESPPLLVGRCLIL 
GLRE PGVPPAAYGRFLRD YMNS I QGKLE 
KQRPQ*GLPSLTGAAELIKH 


10995 


24896 


A 


11088 


128 


326 


KKKS GGGVFF F I IGGGGHFFGGGRFFFF 
FFLGGFFFKLFFFF*EKFFFLGGNLVKH 
SSKKSSWVGW 


10996 


24897 


A 


11089 


109 


2 


KTHSRPGTVAHTCNLSTLGGQGEWIT* G 
QEFETSL 


10997 


24898 


A 


11090 


52 


347 


DHS LLE LQMKL YAS AQAAI TKCHRMGGL 
NNTHFIiFIiFIiSLFFFFLKQSLAIiPPRLK 
GRG P I LVN* KL P LRGKRD SPASP PQ C VG 
ITGLAHPPRLILFF 


10998 


24899 


A 


11091 


386 


48 


IHLQPPLQICTLKFFTNKNPHLFLTGCP 
KYALI FLCFGALVKTGFHHVGQAGLQLV 
PLCPPRLGLPKCSD*RREPPHPAKKQFF 
IDRVMLCYPGWCPALGINPSSPVGLPKR 


10999 


24900 


A 


11092 


164 


3 


VYSLLLRCPFS * ALiTAKF INRCTYIHVC 
MYMCEPYAYTCLYTSTHIHVCMYIY 


11000 


24903 


A 


11093 


148 


325 


CLKLQ P FGLGWAHACNPS T YF I S YQ * L 
SPVI PALWEAKTGGS PE VRS S RPARP 


11001 


24902 


A 


11094 


162 


3 


LLGLKRPIFFFWGGKFFFFFFFFFEIES 
P S VAQ AGVQW * CNLGS LQ PAS GRV 


11002 


24903 


A 


11095 


259 


358 


LRPGTVADTCN* GQARWLI PVI PALWEA 
KAGGS LEVGS LRLAWP 


11003 


24904 


A 


11096 


254 


2 


S WG F I P ME ERERENR YWDRQQGHAI VSN 
IVINAFLYRYTEREKPGVSLSPRVECSG 
MIVAHCNLSLVGSSHSPAAAS*VAGTTG 


11004 


24905 


A 


11097 


132 


3 


KKKKRQ PRWAHACNPS TLGG CGGW IT* 
GQKFRTSL.VNMVKP 


11005 


24906 


A 


11098 


2 


455 


LNLEQLIEPLEVLWTAHLEALDPSSRTF 
VLLAFALL*LPWLQEAGAVQTVPIiSRLF 
DHAMLQAHRAHQLAIDT YQEFEET YI P * 
DQKYSFIiHDSQTYFCFSDSITTPYSMEE 
TQQKSNLELLRI YLLtilES WLE PVQFLM 
SMFGNNLEYDT 


11006 


24907 


A 


11099 


2 


375 


QLPEPLMDLWTGDIiVAMAPGSRTSLLIiA 
FVLLWLP * LQE AGAGQTVPVSRLFDHAM 
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XjQAHRAHHIiAIDTYQEFEETY I PKDQKY 
SLLHDSQTSF*FSDYIPTPCSMEETQQK 
SNVELLR I CLL V 


11007 


24908 


A 


11100 


1 


359 


LQLPKPLTVLGTAHLPAMCLAYATAL I L 
GCALLCLA* VE *ARAAQTVPQSNF*DHA 
MLQAHRAHQLAIDTYHEFEETYIPQDQK 
YSFLHDYQTSFCLSYSIATPSNMEETQQ 
KSNLELL 


11008 


24909 


A 


11101 


1 


370 


LPEPLTDLWTAHLVPMAPGSRTSLKLAF 
ALLCLPWLQQAGADQTVP * S WLFDRAML 
QAHRAHQLAIRTYQEFEETYIPKDQKDS 
FLHDSQTS FCF * DS I ATP SNMGETPQKS 
NLELLPISLLL 


11009 


24910 


A 


11102 


2 


361 


AHLLAMAPGYRTYLLLAFALLCLPWLQE 
AGAVQTVP * SRLFDHAMLQAHRAHQLAI 
DTYQEFEETYIPKDQKYSFLHDSQTSFC 
FSDS I TTPCNMEETQQKSNL * LIRI CLL 
LNESWMEP 


11010 


24911 


A 


11103 


8 


366 


PLRALWTAHLAARAPGSRTSLLLAIALL 
CLPWLQEAGA* QTVP I SRLLDHAMLQAH 
RAHQL P I DT YHE F E KT Y I P KDHKHS F LH 
DSQTSFCF*DSIATPSNMI)ETQQKSNLE 
LLRIFLL 


11011 


24912 


A 


11104 


3 


341 


E PLRDLRTAHLVAMAPGSRT YMLLAFAL 
LCLPWL*EAGAAQTAPLFRLFDHAMLQA 
HRAHQL YIDT YQEFEET YI PQDQ KYS FL 
HDSQTSFCFSDS ISTPCNMEETHQRSNL 
E 


11012 


24913 


A 


11105 


240 


329 


RNW PGAE AHT Y YP S T LGGRGRQ IT* GRG 
FE 


11013 


24914 


A 


11106 


2 


316 


HEERERERERERESAIVRQIHTQGEGLT 
SLVFPYRRPAPSGLKFTFVYVQEHILGV 
GGNGLS PLQ IGVI LRDGRGLAHI RCGTR 
NKILRILKSQGRAPDLS*DLY 


11014 


24915 


A 


11107 


169 


1 


EDGHHSGLHSKQQKQPAWPSMVTHTCNP 
NTLGGRGG WI T * GQE FETS LTNMEKPCL 


11015 


24916 


A 


11108 


192 


355 


GMSLTGHSNYI ILNLNFLWPCVVAHTCN 
PMTLGGQGGRFT*GQEFENNIiAYRGK 


11016 


24917 


A 


11109 


145 


383 


EVNKDKGKYHVSQGIQSFSYVAYVYFRK 
LI YTMMTI VNSALLYTELVI CQGRRVLV 
FDFGF* FWFFETESCS VTQAGEQ* CDLG 


11017 


24918 


A 


11110 


199 


400 


TRAGFFP P FKKFLKGPGPGAP P FNLNFL 
GGRGGGFF*IPKKKIYPLLLPFLKVQKT 
PSKPPFFFFFFFLRQVLLCCPGWSAMAR 
SRLTRA 


11018 


24919 


A 


11111 


373 


177 


QFETSLGTMSGPSFYLKN*KSARCGHTC 
MRFQLLQRQSREDHLSSGVHRCSNP*LH 
RCTTTWVTK 


11019 


24920 


A 


11112 


204 


399 


3SIHFWDWYHFSSPFFFFFFEKNLCSVPR 
AEDQGMNLS *LNPLPPGLKQFFA* PSQE 
AGIIGPGHH 


11020 


24921 


A 


11113 


192 


16 


KKTENNKR * RG YGE TE T PAH CGRAC KLG 
*PLWKPV*QFLKKLKMKLPYDPTIPLPA 
IFS 


11021 


24922 


A 


11114 


113 


369 


AGGQGPNLFLQKPGPPGLKGPPPLPLPG 
GGNYGGGPPSRG IF* FFEKKGFGGVPKV 
GPNLRG*RDLPAPPSKGAHPLGNGFFSR 
N | 



1218 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=IsoJeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProHne, 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


11022 


24923 


A 


11115 


250 


387 


ILCDVEHSLWPGVEVHAYNPSTLGGQDG 
RIA*AQEFETSLNNIARL 


11023 


24924 


A 


11116 


192 


1 


VCDLF WKP S PHALTRRRTS VFHLKR * F S 
WLGAVAHTCNP S TLGG * GF WR I T * AQ E F 
ETSLGNMV 


11024 


24925 


A 


11117 


289 


202 


VFYCQ* KFIRKKKKKKKKKKKKKKKKKK 
KKLSKKIKKMDQQKLRSPTIK 


11025 


24926 


A 


11118 


412 


421 


RA*PTEQVTLGITEQSYSRGHINNRVHD 
LDVGS * HPDGAAAIKGS FVQRLKGY 


11026 


24927 


A 


11119 


16 


472 


DRLKPSATHAFADAYHIGKANP * PLTGA 
LLTLLMTSGLTM*VHLHSITLLILGLLT 
NTLT I YQRWRE VTRE ST YQGQHTP P VQK 
GLRYGI ILFITSEALFFAGFF* AFYHSS 
LAP T P Q LGGHW P P TGI T P LNP LED P LLN 
TSVLLASRVSIT 


11027 


24928 


A 


11120 


304 


346 


GLWHATGITP*PLTGSF*LLLMTSSLAM 
*FHYHSITLLILSLLSSTLTIYH*WREV 

trestyqghhtppvqkglrygi ilfits 

EAVFLARFC*AFYHSSLAPTPQL*GLWH 

atgitplnpl 


11028 


24929 


A 


11121 


174 


420 


FISALLKSAIQRNNAMFSTYYAIiTCTVG 
TAVSWS S VS * PQRNMMMS * VHAPAMAGR 
L VCH Y I R VYRR FEHVCKRAVHGQ YP 


11029 


24930 


A 


11122 


299 


466 . 


IFILHEICITFFFETRHFCH*AEVHWCS 
LGLLKPPPPGLRCDPSTSPSRVAGTRPS 


11030 


24931 


A 


11123 


138 


1 


NPFNEKNPPPKGPGPKNAFFFFF FEMES 
C S VAQARVQ * HD LG S LQ P 


11031 


24932 


A 


11124 


241 


499 


NQRERS *HLYLL YINTTVCLTHGLPIL 1 1 Y 
SAYQIAIYDLLLRRDAIARLQADLADRR 
RGLQPKKRFAFKTRRKDAASSTKVDAAP 
GI 


11032 


24933 


A 


11125 


252 


391 


PLSLDLQSNASLSHFTSPPLMFADR*LF 
STNHKNI *TLYLLFGAWA 


11033 


24934 


A 


11126 


45 


459 


AFLKNHPLET*SNHSFIDLPTLSNISA* 
GNFG* LLGACL I LQ I TTGLFLAMHYS PD 
AS TAFS S IAHI TRDVNYG * I IRYLHANG 
ASIFFI CLFLHSGRGLS YGAFL YS QT * N 
I GI I LLLATI ATAF IGYVLP * GQ I S F 


11034 


24935 


A 


11127 


109 


301 


QDSLMRKKSKNRSPSHER*RSKCLERKR 
S GD KKKKKGG P P KKK I GG FPFGGQMELK 
PFSILGGF 


11035 


24936 


A 


11128 


322 


2 


S CHHVS S LGDRARP CLQKE KKKERKKVG 
DFNTPLS IMDGYQHWYQARKI I IMDGTT 
K* KITRK* KIRNQ *DLTDVYRTLHPRVK 
YT FL S C AYGTF S RTHR I RGQ AI 


11036 


24937 


A 


11129 


182 


2 


FLPRGFQPPPVWPGPLGFLFPPPSSRD 
PGFPSMPFFFFFFFETESCSVS *AGEQW 
RNLG 


11037 


24938 


A 


11130 


138 


1 


GWVFFLWFSFSFLFFFLR*GLSLSPTL 
ECSGAI TAHCSLKL PGLS 


11038 


24939 


A 


11131 


304 


83 


WKKTD F PRKGRNS AS RL P SDSNCNS S LS 
LQGLSLLYRFWTYHAVTGKGSQSRPQEQ 
VLGYHARKNSR*VHRVK 


11039 


24940 


A 


11132 


247 


225 


KNFWGGPPLRGGKKNFFFFFGGGYKK*L 
LLCLVNMYI *CV* YI *HKRI ISDPKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
K 


11040 


24941 


A 


11133 


242 


3 


KKKKIFFPPRYFWAPPYFFFLGPPPFFF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, JL=Leucine, M=Methionine ? 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Un known, *=Stop cod on, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFVF* TQFHLS 
PRLECSGTISAHYNLHLPGPSDC 


11041 


24942 


A 


11134 


361 


2 


ASQSAGITGVSHRTRPTHGFKDT*RKPA 
EPEVTIH* 1*1 TLMSC3SriESPEKVCADI> 
I RGAKE KNLKVKR P AGT P I KAS R I T TR K 
TPCEGSNTRDRFQMRIHKWLTDLHSPFD 
GRVGGRV 


11042 


24943 


A 


11135 


1 


370 


IRQTFFQKREIVGIILCYWNVGYYSQLH 
*SSRMLWIFHFYRVYISVKVILVILVPH 


11043 


24944 


A 


11136 


273 


2 


GGRAGQQILFCRGIWLGPPRSCQCTFRS 
PPGLAL*PWECFSLPPALKIRFPSFLPF 
FFEMEFSRLSPRLECNGVTVAHCNLRLP 
GSSNFP 


11044 


24945 


A 


11137 


130 


354 


GQLNKLSGPYPENVGYTLP VL INPLAQP 
VIYSTIFAGTLITALSSH*FFT*VGLEI 
NMLAF I PVLTKKKKKGGPL 


11045 


24946 


A 


11138 


366 


2 


LVEMGFLRVRQNGLYLLTS * SARLGLSK 
CWDYRRE P PCPASDWVF I LTS PL IHALD 
GKEHTHTHTHTHTHTGLGI CQS SLGKQS 
GGWGWLS ANRGQ FS P FAVCLWS FLPE V 
PWTSALFT 


11046 


24947 


A 


11139 


228 


1 


CRAGRGL PAAVL 1 1 L FP Plili I PTSKYLI 
NNRL I TTQQ * L I KLTSKQM I AI PNTKGR 
T*SLILVSLIIFKATTNLLG 


11047 


24948 


A 


11140 


157 


337 


HIPTPPHTTCPERPS IRHNS I YYLKRYC 
LLR 1 1 LS LLPLQTS P * PLTGALSLLLMT 
SGLSM* SHLHS ITLLALRLLTYTLTI YQ 
* WRD VS RH S T YQRHHTP P VQKGLR YG X I 
VFTTSKDIVFSGLF*AFYHSRLAPTTQL 
*CHWPPSGITPLN 


11048 


24949 


A 


11141 


188 


3 


IPTPGGPPPLPPQKVLFKPGNPRAPPGV 
FFFFFETDSRSWRLECSGVI *AH*NLH 
FRGSR 


11049 


24950 


A 


11142 


224 


3 


SVSSPCISNTHKVDVRSSALLQESTSSR 
EVMTGHFHYLI I YET* LRSLAQS ARLAC 
SGT I SAHCNLRLPGS SN 


11050 


24951 


A 


11143 


262 


371 


S YCGPG WAHAYNP S TLGGRGRR I T* GQ 
EFETSLTN 


11051 


24952 


A 


11144 


244 


379 


TIGKQRCHAGIEEVSDSSQAPWLKPVIP 
AL*EAEAGGSLEPRSLR 


11052 


24953 


A 


11145 


1397 


1565 


DRLESLLEMHI PGVYPNQWNTNFYLF I Y 
FEAESHSVAQTGLQ*RHLGSLQLPPPQV 


11053 


24954 


A 


11146 


284 


1 


GRGRQGGDS SGKGEGNRAMD I TGGLVGK 
GQGAGE AFVEVWE KE LRG F KVE GEGDMD 
KERWD * GL * GGRE RERERE RE RERERE 
RERERERESLV 


11054 


24955 


A 


11147 


64 


286 


THALAWTRZSTQR I G S GHGAMCL YS HLLRK 
LRQNDHLTPGV*KCLEL*LHH*TPTWTT 
KPTAKAIFFNLKKKDSWD 


11055 


24956 


A 


11148 


230 


358 


VSVS LI I FIATTNLLGLLP YS FTPTTQL 
S INLSMAI PL* AGAV 


11056 


24957 


A 


11149 


81 


367 


QTLRTKMNKNWASLIDPTILGLPAAEL 
IILFPPLLIPSCKDVINNRLMTTQQ*LI 
ELTWEQMITIHNTKGRT*CLILECLIMV 
IATTKLLGLLP 


11057 


24958 


A 


11150 


257 


2 


SVCAVITKYLRLGCFADAADTQRPLPPA 
T VKLTMF FN I ATHGE P LGCI F * L F VDKV 
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0=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 














PKTAENFHALSTGEK* FSYKGSCFHRIL 
P 


11058 


24959 


A 


11151 


257 


1 


WRCGE KETL IRC W WE CKLVQ S LGRTVWR 
FVRELKLELPFNPAIPLLGIHPKEKK*L 
DPCTCMFIVALVTIAKMWNQPKCPSMDL 
V 


11059 


24960 


A 


11152 


17 


367 


NHKDGKKTEEKKQSTS PPQKDCS S SPAM 
EQS WTENDFDELREEGFRRSNYELQEE I 
QTKGKE VKNFE KN * DECITR I TNTENCL 
KELMELKAKARELCEECRSLSSRCDQLE 
ERVSA 


11060 


24961 


A 


11153 


213 


2 


TFCHFFFFFETESGSVAQPGVRWCDLGS 
LQAPLPGFTPFSWRQRLQ*AEIVPLHSR 
LKLHFKKKAGVFSC 


11061 


24962 


A 


11154 


1 


347 


GTSNHFPFIiYTYQVHNSTDYPRNRCCLN 
PSLRFHTSSKPLPARPHIMTHQSHAYHI 
VKPS P * PLTGALLALLMTSGLAM * FHFH 
SITLLILCLLTNTLS I YQ * WRD VTRE S T 
YQG 


11062 


24963 


A 


11155 


223 


3 


YSTNLKILRNHFILFF*DRVLLCLPGWS 
AVASSRLTANSASCLSLPSSWDYRRAPP 
HPANFCTDRFHHVGQVG 


11063 


24964 


A 


11156 


143 


1 


RGFLNQCPSQNFPFFFFETESCSVTQAG 
MKWYDLS*MQPPPPRFKLV 


11064 


24965 


A 


11157 


149 


355 


LQPLPPGFKRFSCLRLLTSWDYRKDVDS 
ALLSNYVT*KYISKTLCRHQSPEIVREF 
LTAMKSHKLTKVG 


11065 


24966 


A 


11158 


156 


370 


RSDGCQKIPRNMLGGWSVLHHTVDEQ*G 
PNRY I KKCSASL I IREMHI INTVRRCLT 

PATMAT VRRKTTS VG 


11066 


24967 


A 


11159 


298 


4 


HFQWIMWRQLVKGRLQAQKNAAFTDSH 
THLC YLKCVYFCVCVCMCI CVF VCV* SR 
AL* *SLYLFLCI KAVKAGHSVSGLESQH 
YGRLRQADHLRSGV 


11067 


24968 


A 


11160 


364 


285 


NSWGKIKILKGK*PPPPKKIFFPKYPQS 
FFLSPP * KKKTNFPHPRKYLS P PGI FLK 
APPPXiIFFFFFFFFFFFFFFFFFWG 


11068 


24969 


A 


11161 


198 


406 


KS VYMTLKKKRGS FHERGDVPAI CNFFF 
* IHIYMCVYTHICVYIYIYTHI YI YTHT 
QXYIYIYIYIYIY 


11069 


24970 


A 


11162 


95 


376 


KIITKHNIARTNPYTFCIMN*LEITLQG 














KKKKKKKKKKKKKKRGGGGFKKKPGGGP 
KIRGGE KI FFFFLGGD 


11070 


24971 


A 


11163 


343 


391 


WR * LS S TNHND ILTLYLLFRARAGVLRS 
AVSLLIIiAEQGQPGNXiLGDDHIYSAIDP 
AHAFAI MFF I VI PM I IGGFDN * LLAVRN 
GAPEMAI PH INN ISC* RLPPSLLLLLAS 
AIVEAGAGTG*TAYPPLE 


11071 


24972 


A 


11164 


1 


158 


QPMTPNRGPLSP* PLTGALSALLKTSGL 
AM * FHFHS I TLL I LGIiLTNTLT I YQ * WR 
DVTRES 


11072 


24973 


A 


11165 


265 


466 


KLLTSLSFFNDLRETNQVGLLELSQAKY 
KKT PRLGT VAHVCNP S TLGGQGGW I T * A 
QEFETSLANMA 


11073 


24974 


A 


11166 


204 


458 


KRCLLPLPPDSNTQSSQCRREQIGRAIR 
KILTFETQNIKRLKTEKQLLKLLE *SGK 
ITAHCNLDLPGSS S PPTSASQS AG I TAG 



1221 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ingto first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=l/ysine, LHLeucine, MHMethionine, 
N=Asparagine, P=Proline, 
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nucleotide deletion, \=possible 
nucleotide insertion 














s 


11074 


24975 


A 


11167 


256 


3 


KILFFKPGVLSPPWGGVNFFQLMPFQGG 
PPKKKRGPLSPPKKGPFFFFFFFF*ESR 
SLLPRLECSGAISTHCNLCRLGSSRTRG 


11075 


24976 


A 


11168 


1 


457 


HRTPSVRTPNCRGDPRVRSTNHKDIGTL 
YLL FGA* AGVLS TALTLL I RAE LGQPGN 
L LGNDH I YNVI GTAHAFV J IFFIVIPII 
I GGFGN* LVPL 1 1 GAPDMAFPR INNI S C 
*LLPPSLLLLRASAIVEAGAGTG*TVYP 
P LiAGN Y S H P GAS 


11076 


24977 


A 


11169 


343 


428 


GR* LFS TNHKD IGLL YLLFGA* AGGLST 
ALSLL IRAELGQPGNLLGNDH I YNVI GT 
GHAFVIIFFIEIPIIIGGFGN*LVPLII 
GAPDMAF PR INNI S F * LLP T S LLLLLAS 
G I GEAGAGTG* TVYP PLAGN YSHPGAYV 
DLT 


11077 


24978 


A 


11170 


91 


2 


PRQDPSFFFFFF*TESGSVAQAGVQWCD 
L 


11078 


24979 


A 


11171 


219 


1 


IFKSKILCLPPPEPNFFFFFFFFFFFFF 
FFMRHGGACCSPSYLGG*GERMT*ASRV 
AGTTGVCHYAWLI FVFL 


11079 


24980 


A 


11172 


292 


365 


RI *SLGQAQWLTPVI PTLWEAKVG 


11080 


24981 


A 


11173 


251 


1 


I CDLQ I LTHGITTC VKMGLTTL FVSGI T 
FLIHHSLAI KFGV* KNPS *LGT VVHTCN 
LSS*GGRGGRIA*AREFETCLGNIVRP 


11081 


24982 


A 


11174 


200 


407 


KI FETDVYVKCSNILCLWAS I SFSNYFG 
GPFYLNL* * FLSYFSFLETESHIVAPLE 
CSGMILAHGNLCL 


11082 


24983 


A 


11175 


11 


155 


STNFFLFSETGSGSVLQAGVQWPGLDLL 
AS SHS PAS AS * VSGTT VDAA 


11083 


24984 


A 


11176 


140 


1 


PSTFPDWAMRRVSCLSKGDKGQV*WLT 
P VIPAI WEAKAGGS PES I 


11084 


24985 


A 


11177 


210 


1 


TDEELLLMDEQRK* FIEMESTPDEDAVN 
IVEMPMKDDEYYINWD*WAGLKRTDC 
SFERSSTVGKMLSN 


11085 


24986 


A 


11178 


133 


148 


STSTR*LFSTSRKDIGALHLLFGA*AGV 
LGTAVS LL I RAWLAQ LG SNKE A 


11086 


24987 


A 


11179 


236 


1 


KFWKKDPLTKGGCPLKKKGGGGIFSKKK 
KNVFFPKKKFKGGKKKNFFFFKKKKKKK 
KKKKKKNKE * LF I S FAH IAVHL 


11087 


24988 


A 


11180 


210 


1 


QS LGQENLQVFFPLFGP * PN YLDKGEKW 
I R VG K I WGHL FF FFE TE S RS VAQ S PRLE 
CSGAIAAHCNLCLP 


11088 


24989 


A 


11181 


292 ! 


348 


KKGALFFSPGG*KKNFPPGGKKKGPLFS 
KKKKNPP * KKPGE KE KTRFP P FLGGLGG 
GFPLAPGGKPPRNRVLAPPPNPGGKKKK 
KTPFSQKKKKKKKKP 


11089 


24990 


A 


11182 


168 


3 


KPEASMLRMDFFFF*EAGSHSVAHAGVQ 
WYYCSSLQPPTPELSDPLAPSSKVGGIT 


11090 


24991 


A 


11183 


87 


351 


KG I KWS LN I QRGVQ PHLR S YN YS F F S F F 
S*DWVLLRLECSGTTTAHRSLNLPGSSD 
PPTS ASQAAGTTGMHCHTRLI F * ETGFH 
HAVQ 


11091 


24992 


A 


11184 


55 


344 


TSLNLCFFFFFWGEKSPFLPPGGDEGPL 
FGQLDTRPWGIEKIFFAPPP*KGKRGVP 
PGGEEFKNPPAPKGEPPLS*KK*NQPGQ 
GAPPCYPPFLGGGAKKIFSIPQGRVSN* 
P KRGP S S P PGGKKGD F S P QKKKKKQRFR 
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W=Tryptophan, Y=Ty rosine, 
X— Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EV**PNSPSGAP 


11092 


24993 


A 


11185 


344 


190 


LVETRFHRVS QDGLNHLLTL * S TRLGLP 
KCWDYRCEPPRPANTANSLNFPM 


11093 


24994 


A 


11186 


135 


3 


IFPKVFGGPFFFFFFF*ETESRSVAQVG 
VQWCNLGSLQAPPPGFT 


11094 


24995 


A 


11187 


305 


14 


I ML ITS IRRS RLS S TAEKGTLS RLR I CG 
RVAKYL INNRL I TTRE * L I KLTSKQM I T 
ILNTKGRT*SLILVSLSWYGYHLF*G* 
FD*SLLLY 


11095 


24996 


A 


11188 


198 


1 


LE Q TFNMMF P PNKS 1 1 S I P S FGFLFK I S 
FPFLIiFFF*DEVFLLLSRLECNGTIIiAH 
CNLCLPGSSD 


11096 


24997 


A 


11189 


352 


3 


QLCNTPLPKTNHAMLTSSNANTLIYRRS 
AIAAASSPPILFPASKILLNSRLITPHQ 
* L I KLAS KQM I AI PNTKGRT * S L ILVS L 
IICFEVSDISHCWWISRLLMRYLEVGI 
NRGG 


11097 


24998 


A " 


11190 


270 


137 


CIINRLYVCVYIYIYIYIYI* IYFYMYV 
YNILCXLGHRRESGVI 


11098 


24999 


A 


11191 


18 


342 


ACRKRKMSSKFALRATGNC YYS CTFCS I 
SFPHPK*MPCYQKKKKKKKKKKKKKKKK 
KITV* K* SKTK30TIKSLGAR*ELSLA* L 
LGE P P I FAKGHPLMDKWVE I CKRP 


11099 


25000 


A 


11192 


169 


3 


ERSGI IRVYQD*NQCLAWSLtA* KIIHPW 
LAAVAHT CNP S P S GGRHG W IT* GQE F K 


11100 


25001 


A 


11193 


407 


1 


KRAC FR WLTLL I S TL WEAKQE1STCLNAG V 
HDQ PGQHGETP S PQKI KKS AWHGGPRL * 
EVRWED* LSQGG * GYSEPCSHHCTPAWA 
TERDPV*KQKRS I PWKELASERKDSGRC 
RWLVREMWPSCQASVIQVRRI SF 


11101 


25002 


A 


11194 


110 


412 


VCGFRLLHSPAILMPLLMPSYFRRHSNT 
E IKP ISNPTVAS KRASERKS CTSLTLNQ 
KLEMIKLSEEAT* KAEVGQKLGLLHQKV 
SQWNAKEEFLKEIESA 


11102 


25003 


A 


11195 


34 


155 


AWHEGMHHVS QDGLNLLTL * DACLGL P K 
SWDYRRKPPCPA 


11103 


25004 


A 


11196 


289 


453 


NQPGKS FMISST FHFMME FTLVHAGV * W 
LDLGLLQPPPPRFKGFSCLHLPSGWDR 


11104 


25005 


A 


11197 


177 


432 


KWE KQ AQ TLGGGNF P KG P KTR VG KNFE K 
LKGNPPFLCL*KRGRKGDTGPWKKPAPS 
GERGIPGKKGGKTGPPGFKGPMEKIKRG 

K 


11105 


25006 


A 


11198 


139 


3 


AMVS LS PRLECS SM IMAHCS LGS LGS S D 
P PTS AP * VAR I TGMHHP 


11106 


25007 


A 


11199 


150 


492 


GDPLPGHHPV*EVRRPFARQLPRLRSEE 
PLR PVAT P S GK * GAS LSGS HP I QEGGGG 
GQPLPGHLPHPGWRSGVKPPPGQSARPG 
GEGHLLPATPTGK*GGPQSGQPHLSGGR 
RE 


11107 


25008 


A 


11200 


239 


64 


YQKYYYYYFF*ETESRSVARVKSSGLIS 
AHCKLHLPGSRLQLCLFPTSAVFGQESH 
IT 


11108 


25009 


A 


11201 


31 


396 


I L.TMRE I VH I QAGQ CGNQ I GAKF WE V J T 
DEHGMDPTGTYHGDSDLQLDRISVYYNE 
ATGGKYVPRAILVDL* PGTMDSVRSGPF 
GQIFRPDNFVFGQSGAGNNWAKCHYTKG 
AELGDSVLDV 


11109 


25010 


A 


11202 


85 


407 


VLLPHSEALEGADTMPHSYPALSAEQKK 



1223 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
C=Cysteine, D—Aspartic Acid, 
E=Glutamic Acid, FHPhenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine ? P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyr osine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ELSDIALQIVSPGKGILAADECVGSMAK 
RLNQIGVENTQENRRLYRQVLFNADDRA 
KKRIGGVIFFHETL*QKDDDGVT 


lino 


25011 


A 


11203 


302 


115 


THLDHVAIYLSIYLSIYLSIYHLSIYHL 
SIYLCRF*LID*DRVSLYFSGWLQMPGL 
KPSSCF 


11111 


25012 


A 


11204 


130 


348 


GNAYGGPCSVLVQRT* * IPALTFPSTCL 
DSKFQRDLVTLRTDFSNFTSNTVAEIQA 
LTSQGESGLDGALGWGQ 


11112 


25013 


A 


11205 


232 


62 


I S ARNEGFNTLN * RTFFFFF FS SHCLSR 
LECGGMILAHCNXiQLPGESLEPGRQSLQ 


11113 


25014 


A 


11206 


161 


2 


KKKGFLLiPL I E P F C IQE I QLRLGAVAHT 
YNPSTLGGRGGWIT*GQEFKTSLAINr 


11114 


25015 


A 


11207 


251 


3 


PNHLPKAPSPNTTLRIRFQHLN-FRGAQQ 
TFGLQHKLTSKQMITIHNTKGRT*SLIL 
VSLIIFIATTNLLGLLPHSFTPTTQL 


11115 


25016 


A 


11208 


145 


287 


GFRLKL * INQKYS FLKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKK 


11116 


25017 


A 


11209 


436 


no 


ISGRGEQKFKYFSPAPFFRGINPPPLGG 
E *NQNLKPP * KKI FLGS PKVP KKKGGP Q 
KILKKGPGGGGPPLFPPPLGGQGGGFPF 
SQGFKTP PGKKKKP P PKKKKKKKK 


11117 


25018 


A 


11210 


217 


2 


DKWSVLHINYNILS SSLVHVLFKYLEN* 
K* KKNLW PGMVAHACNTNT LGGQGGR I V 
*AQEFETSMGNTVKC 


11118 


25019 


A 


11211 


146 


3 


QKVSHAQYRCLS WG YFF F I LFLETRSRS 
I AQAG VQWRDLGSL * PP PA 


11119 


25020 


A 


11212 


187 


384 


GLICFYIiTCFYFIFYFILFYFSLLCYVM 
LCYVLCYVMLCYVMVCYVMLCYVI*DRV 
SFCHPDWSAA 


11120 


25021 


A 


11213 


373 


482 


MI KLRHRLGTVAHTCNPS TLGGRGQW I A 
*GQEYKTS 


11121 


25022 


A 


11214 


64 


362 


I TG VS HFTQPLPFI GGLAL S P KLE CRGM 
IIAYCNFELPGSGDSCHLSLPSSWNYRY 
MPCMPS *LKKNF* * RQGL T VL S RLE KD Y 
LL*DFSALRSFPWGR 


11122 


25023 


A 


11215 


181 


368 


CDS AVPLLD I YPTE FKADLKK IRI PMF I 
I ALFAVAKR * KQPT YPS IDKWI KKLWAG 
CGGSRL 


11123 


25024 


A 


11216 


268 


369 


TTRPDMWHT YNS S TSGGQGGT I T* AQQ 
FKTSLT 


11124 


25025 


A 


11217 


90 


3 


I GQAQ * LTP 1 1 PALWEPKSGGS PEVRS L 


11125 


25026 


A 


11219 


277 


272 j 


S LTAPGL * S QLLRRLRWSDHLS LGQS S F 
S EP * S CP CTPAWVTQ * DSVSKKKEKKKE 
KKRKEKEKVNQTPKATES PGQAL IGSHS 
LQVQKRVYAHPIL 


11126 


25027 


A 


11220 


19 


343 


LELEVEPEDVSELLQSLDKNYLSCFLKR 
FLEMKYTPGEDWNTIKMTTEDLKYNTD 
LVDKAAAGLERINFNFEGSSSVGKMLSN 
S IAWYSE I FYERQSQFMWQT*LLA 


11127 


25028 


A 


11221 


266 


356 


FFEMESHTVA*AGVQWCDLGLLQSLPPG 
FK 


11128 


25029 


A 


11222 


105 


2 


RF VC ST I KVLRDL S S ARSNPGRF I * T SN 
SRPRKS 


11129 


25030 


A 


11223 


145 


350 


RKGRQLLD * DLGW V * WLTP 1 1 P I LWEAE 
LGGSLEARSSRPAW 


11130 


25031 


A 


11225 


155 


348 


PLFFFCFHRNTFY* KSVCRFPKKKKKKK 
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Amino acid sequence (A=AIanine 
C=Cysteme, D—Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lrysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R=Arginine, S=Serine, 
T-Threonine, V=Va!ine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKKKKKKKKMEGGGGGANTKFLY 
GENRKPPLFFFF 


13131 


25032 


A 


11226 


273 


3 


NFLS VFNFTR I CLGMCFLS I FLL I FGGS 
LKYQLFLAQGFFFFLHYFFTSSIHSSLL 
Y*MLDFLDLFFLYLFFCNFNFSVFLFCI 
liSKFF 


11132 


25033 


A 


11227 


3 


413 


S PAE PHRS YTMWNVKVGVNGFGR I ERL V 
TRAAFNS GKVD I VA I NDP F I DLN YM V YM 
FQYDS THGKFHGT VKAENAKLVINGNP I 
TIFHERDS CKI KRGDAGAE YW* STGVF 
TTMDNAGAHLHG * ATRGI I SAP SAD 


11133 


25034 


A 


11228 


48 


413 


VFVLVSCGFFFFFFWEKKICFFWPGGKK 
RGK I GGTGTFG P KGKGNFRG P P P KNKE K 
GGPPPPPNNFGFLKKKKFYQGGQGG*KT 
RNFGKPPAPNPPKGGDLGGGPPTQGQNG 
FLKKKKKQKK 


11134 


25035 


A 


11229 


172 


404 


E FENS CS PGWQRRG VM I PG VT VEDMNQ Q 
EFIRALVALLKKSRKLKVPEWDTTKLA 
KRKELTL * DEKWF YT * AAS TT 


11135 


25036 


A 


11230 


247 


2 


QPKKKSVSKKKKKELPCDPAIPLPGIYP 
KEKKS VYKKDNLHLRVYCRAVFT IAKIW 
NQP KG * SGDK* I KKI * YI YTVDLRR 


11136 


25037 


A 


11231 


1 


372 


TETTVEVAWCELQDRKLTKSERQRFKEE^ 
AEMLKGLQHPNITOFYDSWESTVKGKKc' 
IVLVTELMTSGTLKT* VHQYYKS * PRSM 
REFD FLKf FRS TS FTL WQPKKGTVLPTV 
TVLAYFLAHKVL 


11137 


25038 


A 


11232 


311 


464 


KKATPRNLCKWPGTVAHACNPSTLEGRD 
GW IT * GQE FETS LTNMVPP 


11138 


25039 


A 


11233 


64 


418 


VP VASRRRRRCGRVGGGKAMADLDKLN I 
DSIIQRLLEA*GAKPGKKVQLHENEITG 
L GLEPRE I FLRQP ILL* LVAPLKI CGD I 
HGPYYDLLRLFEYRGLPPKSNYLFLEDY 
VDRRQH 


11139 


25040 


A 


11234 


233 


421 


PTTSNMRSRLLRGAS S * KKTAG P QQRNL 
E P ALPRRWGGRS ADNP P S GS LRKSGKNM 
QKTPGTA 


11140 


25041 


A 


11235 


343 


1 


NNTHGLSHS S VGRNPGAVGSGS CMSGIK 
LSEGLRSLLEAVCISRAAVIKCHSWVAS 
NNRHFFSPSSGG*KSEAKVWAGPCSL*R 
RVPSLPLPASGGSRHCLVCGCVTPFSAF 
TO 


11141 


25042 


A 


11236 


122 


419 


RT PRGP KPNPGL QR KT F FL I GGR I KT YL 
GIFGKDLYFWGGKNWDTLLPKI *RSREN 
KNF*REKWVNLPPYPGCLKNGARQQHFA 
HLLILRDTKTYMLAF 


11142 


25043 


A 


11237 


1 


379 


AFNHLHAGHGLSGAAMKSLVLLLCLAQL 
WGCHS S PHGP WL I YRQPNCDDP * TEE AS 
LEAIDYINQNLPWGYKHTLNHIDEVKVW 
PQ*PSGELFDIEIDTLETTCHVLDPTPV 
A* CNVRQLNEHAVQ 


11143 


25044 


A 


11238 


1 


397 


QTMTPNRGPLCPSKDLRSSHVISLPLHS 
ATHTRPTNQHTNHI PMMARRSTRKHIRR 
APHTTCPKRPSIRDNPIYYLRNFFLRRI 
FLSLLPIiQPSPYPPIRRALAPNRHHPAK 
SPRRPTP * PLTGALS ALLKTSGLAM* FH 
FHS I ALL I LGLLTNTLT I YQ * WRDVALE 
S T YEGHHT P P VQKGLR YG IILFITSEIF 
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Q=GJu tannine, R=Arginine, S=Serine, 
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nucleotide deletion, \=possibIe 
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FFAGFF*AFYHSSLAPTPQLGGHWPPTG 














ITPLNPLEDPLLNTSVLLASGVSIT 


11144 


25045 


A 


11239 


267 


3 


FVSVRITPIYASAATWLSYVWVWASSPN 
VCPPGDRVALSPRVECSGTISAVRNLRL 
PGSSDYPASA*PADGWAPPQYRCPPPRL 
ANF 


11145 


25046 


A 


11240 


391 


31 


VFFFFFFFFFFFFFFFFFFFFFFFPPPF 
LKNPFEPPPNFFFFCPKF * ILFLN* I * F 
*IKF* I *KIPGFFLNPPPFFPPFFFFPP 
GALFFFFFF*EGVLPCSPGWSAMSHLLS 
EKRKLTPRMSYN 


11146 


25047 


A 


11241 


371 


491 


KI FLCWARHDAHACNPS I FRGQGGH IT* 
GQEFKTSLANMV " 


11147 


25048 


A 


11242 


1 


221 


HRIDPLCRNSARAFFFLSTGCHRVSQDG 
LDLLTS * S AHLGLP KCWD YRCKP PRPAL 
NCF FRT I S QLLNLC I KD 


11148 


25049 


A 


11243 


382 


333 


TYKIKDFKTF*RWFHPNISRVEAEKLFL 
SRGQRGDFLARPSESSPGGFTLSVR 


11149 


25050 


A 


11244 


169 


477 


TQKDEKESTTTTKNAENSKGQSAYSSPS 
DCNTS IARAQN* EEAEMDE LTE VGFRRR 
VIMNFVKLKGHILTQCREAENHDKTLQK 
LLTSITSLEWNINDLMELK 


11150 


25051 


A 


11245 


200 


462 


S LIYNELCTIHILYQYYVA* QRS * CFQQ 
LMSFY*RTLKNLKESLCKVALLYPSFLK 
WKRGWLGTVADT*KPSTLGGQGGRIA*A 
QEF 


11151 


25052 


A 


11246 


484 


301 


I FCEKKGL* FQLLRRVRWEGPFKTPGNQG 
CS*LLSSPCPPGWITE*DPVSQKKBCERK 
RNQVMELVSAQENKNKPRSVGLQACARS 
MLAFSTTPR 


11152 


25053 


A 


11247 


226 


458 


NTVAVDWGPRCFFSLLLSHAENCDMWFT 
KI ELF I FTNHLLLKQDLEMLW * GQAHAC 
NPSTLGGRGG WI T * GQEFETS 


11153 


25054 


A 


11248 


304 


436 


SPSRCRK*KIG*AQWLTPVISTLWEAKV 
GRSLDPRNSRSAWTMY 


11154 


25055 


A 


11249 


216 


451 


THKYLFYYLGYNPMLYI ERETGS CS I A* 
AGVQWYSHGLLHP* PSGLKRSSQVAGVI 
CIjANF FF FFIiKKWS LWLPRLG 


11155 


25056 


A 


11250 


260 


474 


DKYFLLYPLKALNWLGAHINI * KVSKI 
MC * K* DKNMYNFFFS F * DGVS LLLPRLE 
CNGTILAHCNLCLPC 


11156 


25057 


A 


11252 


193 


496 


MTSCYSFYL* IKDASMVKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKASSSQKAQDYS 
KASDYK 


11157 


25058 


A 


11253 


348 


478 


DLGWLKLGTVAHTYNSSTSGGQGRRIT* 
GQKFKTS LANWKPC 


11158 


25059 


A 


11254 


174 


475 


MHHW VL I LQD I DDNT * NCNLH FS L C S QL 
KSIQKEKKKKKKKKKKKKKKKKKKKST 


11159 


25060 


A 


11255 


292 


48 


L PLT CHRPAVSHTHLAFSNP TAGITIF * 
TRSSSWNAGTFIIFK'IVTFFLFFFF*ER 
VS LKLTRLERSGAI PAHCNLHL PE F 


11160 


25061 


A 


11256 


347 


16 


TQSADWCIYNPPARHKSSPSPHPTQ*PS 
WLHLVDPAPGPWVELPTSPAPSACTPQP 
LGSRWDQPPQSKGQCPSGRLGPQGSPPQ 
VGLGKAGWRSQAL PVGLTRS I E SHHW 


11161 


25062 


A 


11257 


238 


590 


APGAYIFPQQHNPTDSQVIFSLTASHLL 
I CS PLC I S VLHFNGMNIDDENRNAHVHA 
RACAHAHTHTHTHTHTHTL YLMNYRE TL 
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Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=IsoIeucine, 
K=0Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ML L KT RGRATG S HARC I FS * SLLQACP 


11162 


25063 


A 


11258 


310 


461 


LSFGDVLQLESDRPGAVARTCGPSTLGG 
*GGWIT*GREFETSLANML,KPR 


11163 


25064 


A 


11259 


461 


174 


MG FHR VS QDGLDL LT * S S AHLGL PKC WD 
Y*CEPLRLAPVGKLFKCVGYKLVLCHGS 
GYMSMVIKIRMPGLIWSLEGGSAAWHR 
YWSIWEISFGC 


11164 


25065 


A 


11260 


343 


472 


IjLRRLRQKDCLSSGGQGCSEPRLDYCTP 
AWVTK*DPVSHKQE 


11165 


25066 


A 


11261 


229 


3 


VKMKLQ* VP VKVKI * KYTPLLGI YLKEL 
KAGFGRNTCTLMFIAALFTIANRWMQPR 
LP*IGEGVNNM*YIHTMES 


11166 


25067 


A 


11263 


163 


399 


KLLPGTRNLHAMEHPIFGWIiRNPHATAQ 
G*HPLSQSSLiAIiHGRADHICYPELCTSS 
S S C 1 1 AGYPN * E GMFAS QHHRVH 


11167 


25068 


A 


11264 


316 


427 


TNKKHI 1*1 INTTTHSLI I S 1 1 PLLFFNQ 
INNNLFSCS 


11168 


25069 


A 


11265 


184 


418 


KTPPGFFEKNPFFWGGKKGPPPPPKNLP 
LGGKKKFLRGKGGKTFFFFFF* KIGSSS 
SPPPPPGGEQAPPPPPKSPPRR 


11169 


25070 


A 


11266 


219 


13 


KFFFSLEHQNFGGGGVPLFPPPKEGFLP 
KKPKQGFITPPLKQKIKPPPPGGYSGPP 
RVL*KGPPHIFYK 


11170 


25071 


A 


11267 


3 


173 


SHHARPETGFHRVSQDGLDLLTS *SACL 
GLP KCWD YRRE P PRQARKL YCLGMRTNE 


11171 


25072 


A 


11268 


157 


2 


PVD I THVKTT VGLGMVAHACMP S TLGGR 
GGRIT*ARGSRFLLPPHWTSELR 


11172 


25073 


A 


11269 


291 


468 


ISEGKRSL.FVILHFILFFETGSCSVAQA 
GVQWCDLDSL * PHIPGPK* S SHLGLHTG 
ACH 


11173 


25074 


A 


11270 


29 


478 


GWN P YNQ QD I GT L YL * S GARAG VLGTAJL 
CLL I RAELGQPGDL.LGNDH I YNVIVTAH 
AFAIIFFIVTPIIIGGSGN*LVPLIIGA 
PDMAVPRINN I SF* LLPTS LLLLLASAI 
AEAGA* TG* T VYPPLAGN YSHPGACVDL 
TIFSLHLAGV 


11174 


25075 


A 


11271 


61 


474 1 


P YNQGGNDAHAAWPHLTKWRSQGTTE I C 
REPEGAWQRPSQPRLHKGRSPGQLRALS 
QNLPSYSGGSHLLSAYYVPDTISKHFIN 
TIGWLGTVAHACNPSTL*GEDRWIY*AQ 
EFKTSLANVL 


11175 


25076 


A 


11272 


277 


478 


KIYQPVKIHGPRPGAVAHTCNPSTLGDR 
DGWIT*GQEFETSLANMNSPSL 


11176 


25077 


A 


11273 


350 


11 


KEKTAGQARWLTPVI PALWEAEAGASPE 
VRSSRPA*PPP 


11177 


25078 


A 


11274 


138 


3 


NY I NL YAW LGAVAHACNP S TLGGRGGW I 
A* AQ E F KTS LGNMAKP 


11178 


25079 


A 


11275 


359 


219 


FVIKMGFPQVGRDGFNFLAS *SAPLGLP 
KWWDYRH * PPRLALFFFF 


11179 


25080 


A 


11276 


262 


3 


FCMSLVHLFSHSQDNYFTSFSL* IFNIP 
SGPFLFAFQDTLQ* P *DF I YFYFETES P 
S VTRLECSGAI SAHCNLRLLVS SNSPAS 
AS 


11180 


25081 


A 


11277 


285 


1 


VKKGDSVS KI FRKKKKRKDKIRLHFM I T 
TVNNLG I EGNLLNM I KE I YKNPTANI IL 
NGERLKAFSL*SGTRMSFLTPTFF*ILV 
DTQQVYI FMSH 


11181 


25082 


A 


11278 


160 


3 


TDKEVPLINAQRKWFLEMES IPGEDAMN 
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Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutainic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S-Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \-possible 
nucleotide insertion 














IV*MTTKELQYYINLVEKVIARPE 


11182 


25083 


A 


11279 


334 


488 


PLNTCGY*NLN* *NFQPGSVAPACNPST 
LGGQDRWIT*GQEFETSLANMVK 


11183 


25084 


A 


11280 


527 


356 


F F VFL VET AFHH VAQD S LDLLTS * S S RL 
NLPKCWDYRHEPQRQAKNVGEFSETRSI 
Q 


11184 


25085 


A 


11281 


486 


248 


VRQh FS LliBPRLE CNG VI S AHCNLRLPG 
S CDS S ASAS * VARITGASGS QAWLQVQ 
CLQPVQPGELLRVDLFQLWLQR 


11185 


25086 


A 


11282 


38 


157 


STGLHHVSQDGVDLLTS*STLPKCWDYR 
HDPLRPASQRL 


11186 


25087 


A 


11283 


474 


339 


FVFLVQMGFHHVGQAGLELLTS*SACLG 
LPKCWD*RHEPTDPAK 


11187 


25088 


A 


11284 


2 


462 


LETTRTARWNS VIALRHHNS CPYKMAKTL 
IDKCVCSDYSKDWPFPIFLPLLRPPYCL 
RHNNMETRSINNPPWPSKCSSEKTNEEG 
MLKVE I G* KLCLL CQT FS KDMNAEKKFL 
KEIKSATPMNTSMIRKQNRLIHDMEKVL 
VIRIEYQASHNIL 


11188 


25089 


A 


11285 


65 


280 


SRAVEFNLLTT* SACLGLPKCWDYRCEP 
PHPASP*FSKLLSSNLKTYLSSL*DSHS 
GF YS FVCSLGL 1 1 LI 


11189 


25090 


A 


11286 


244 


488 


MRLGRVPSVLFCYVRMQREGI ILEAENK 
PSPDIE SVGAL I LDFTASRT I RNKI LLF 
IIFPG*GILDSPNVP 


11190 


25091 


A 


11287 


345 


445 


KR P GT VAHACN PHTLGG * GRW I T * GQE F 
WTILA 


11191 


25092 


A 


11288 


223 


500 


SPGTRPKVLECQSTLGRQAILLLGEGWL 
WGQSHRLGVQSQLCLCVLTOLTCSLSCS 
FQI IKLR*RPGVMVHTCNPSTLGGCGGR 
IAQGQEFK 


11192 


25093 


A 


11289 


157 


22 


LSCQEVKGEIGEAEICWAQWLPPIIPAL 
WGAKVG * S PE VRS S KPA 


11193 


25094 


A 


11290 


305 


497 


KLVNCIYFKGVSFMVRELYCNKIV*WLG 
TVAH I YNPSALGGACGW IT* GQE FETS L 
TNMAKPRL 


11194 


25095 


A 


11291 


478 


1145 


QHVQACPERPQMMGTLERSRAVASKIGH 
SYSLDSQPARAVGKPWPQQACTRVTELT 
EATGKLIRTSHIGKPHPQSFQPAAATQK 
LRPASQQGVQMKTQGGASNPALQIGSHP 
MCKSSQFKSDQSNPSTVKHSQPKPFHSV 
PSQPKSSQTKSCQSQPSQTKPSPCKSTQ 
PKPSQPWPPQSKPSQPRPPQPKSSSTNP 
S Q AKAHHS KAGQ KRGGKH * RANS RD L 


11195 


25096 


A 


11292 


352 


468 


TFPGQHGKI P SLLK I QKLAGHGDACL * S 
RLSPR 


11196 


25097 


A 


11293 


295 


24 


ENIQGSKKNERQHMHNGVGTWNSNPGFV 
ST WVYRNCLACLD ISFSFI KW VKSHKDT 
SMGMFTAALFTIAKT*NQSTCPSMVDWN 
STTRTS 


11197 


25098 


A 


11294 


274 


429 


ECEPVQTLWKIRKFLKKLKIELPSDPAI 
PLKGINLKEVKS VP * SDTYTPMCI 


11198 


25099 


A 


11295 


168 


469 


PSLGNKSETPSPKKKKNLFLGFPYFREK 
LVG I Lli KT VLQKGAGG VF FG P KNP PE KN 
PDPPLDSPPKTFRRLGAFVKTFPKGLKA 
AWFPALV* PQGQMGG 


11199 


25100 


A 


11296 


37 


348 


IKIFNKTLKYFKKKKTIKKKKKKGGGPL 
KRPRGG PR I Y PDGARKNLPH I GTAKKHW | 
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nucleotide 
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ing to first 
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residue of 

peptide 
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Predict- 
ed end 
nucle- 
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location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














GG YWE KJtf PHGGGGNTGPTP P Q KTRP LGG 
KNNLKRVQGEKPRPRLGGGKTRSQLFSC 
HA* P * VQGEKPRPRLGGGK 


11200 


25101 


A 


11297 


93 


388 


DEYASDNCCYFPGSSHDSQVKQIGSQFT 
TQIQAHLNLAKPRRAL*NHS * PMEMRSK 
CLLLAYKKKKKKKKKKKKKKKKKKKKKK 
PKYLS SRGKDHF 


11201 


25102 


A 


11298 


61 


243 


YYIHRTTVFMCMN * GLKDWVDK* TIDLC 
hCKKKKKKKKKKKKKKKKKKKXKKKKKK 
KKKKKKKKPQGGGA 


11202 


25103 


A 


11299 


250 


341 


I YM S LTLGLQNL * E I HLTH KD S HKLQ VK 
GWKNI FHANVNQKQTGVAILISDNIDI K 
ATTVKKKKKRENPTILNIYVPNTGAPKF 
MKHLLLNLKNERDGTTVLG 


11203 


25104 


A 


11300 


3 


285 


DAWE RKKRS QMNQ KTTNKVAGVS PYLS I 
MAWNVNELNFL I KRHRVA* CI KKTKSMI 
C YLQE TN F I KD THRL KI RGW KK I FHPNR 
N*KLAGWY 


11204 


25105 


A 


11301 


175 


407 


AENNPDVLQLKA* TSRGTFMP * ILLRVK 
RGSNS RNNRCWQGWRGTGMLLHC * QECK 
LVQPLWKTVWRILPDLESEIL 


11205 


25106 


A 


11302 


75 


280 


YHLNLVLYPHP PKNRVC* KKKKKKKKKK 
KKKKKKKKKKKKKKKRGGPLKKNLLGGP 
FFPGGKKKNFFF 


11206 


25107 


A 


11303 


299 


3 


IHASFLFRGCITLAFTVRCSHRSITSSE 
TSDLTHYYYYYYYYF*IASQSVALAGVQ 
WDQLGSNHPPTSAS * VAETAGAS HHARL 
IFNFFVEMEVSLYC 


11207 


25108 


A 


11304 


137 


323 


FFTVNGRHRKEKKSNKDQLNLGLHDFLD 
NPLLLF * KYLGRE KKKKKKKKKKKKKKK 
KQKKKK 


11208 


25109 


A 


11305 


64 


481 


RPTRPHVYHI VKP IP* PLTGALS ALLMT 
CGLSM*FHFHSITLLILGLLTNTIVTIYQ 
* WRDETRESTYQGHHTPPVQKGLRYGI I 
LFITSEDLFFAGFF*AFYHSSLTPTPQL 
GGHWPPTGITPLNPLEVPLLNTSVLLA 


11209 


25110 


A 


11306 


142 


330 


FFTVNGRHRKEKKSNKDQLNLGLHDFLD 
NPLLLF* KYLGRE KKKKKKKKKNKKKKK 
KKKKKKN 


11210 


25111 


A 


11307 


259 


440 


RP W I RDNP I Y YLKS CFLRT I FLS LLPLQ 
PRP YPPTRRALS PNRHHPAKS PRS P * PL 
TGALS ALLMTSDLTMGLHFHY I TLL I LG 
LLTNTLT I YQ* WRDATRESTYQGHHTPP 
VHKGLGYGIILFITSKVVFFARFF*AFY 
HSSLDPTPQLGGHWPPTGITPLNPLEVP 
LLNTVRL 


11211 


25112 


A 


11308 


433 


3 


FPPPDIRHSRSLGFPPSFGRAGFLNFAF 
SKGQEPALPGGFYYPSPKKKNLAFCSPP 
LFYWGKFGSKFSLGVGPLFFP**SPKSF 
CQFLENKGFFPEKS FFKFFP PAGGCVFF 
FSEGGFPFCLFLFFFDRVWLCHPGWISV 
ARS 


11212 


25113 


A 


11309 


2 


423 


GRVGDR * LF YTNHMD I GTLYLLFGA* SG 
VLGTAL S LL I RAELGQPGNLLGNDH I YN 
GI VTDHAFD I IFF I AI P 1 1 IGGFGN* LV 
PL I IGAPDMAVAR INNI S V* LLP P SLLL 
LLAYAI VEAGAGTG* TVYPALAGNYSHP 


11213 


25114 


A 


11310 


234 


1 


TFFGKFPFCLKFKPAQRGGENTPGPFPF 
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location 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F— Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 














LVFFFFEMESRSVAQTGVQWRDLLGSSN 
S PTS AS * VAGTTGACHHARL I F 


11214 


25115 


A 


11311 


406 


211 


LSLFLHSLISTHGARPTDPVGLVPTQDG 
LDLLTS * SAHIGLPKCWDYRRAPPCPAC 
GFDLHLPND 


11215 


25116 


A 


11312 


379 


398 


KT*NFTVRGS I PLLNNIPMASMTFSKKK 
BCKKKKKKRGGPLKKNPGGAKFNGGRKRK 
IFSLKGGEKKNPGGILEKKPFFGGGKKG 
ENP P KKNKGLRE KKKF* RGKGGKKP Q KG 
GGKKMGSSSAKTPTWG 


11216 


25117 


A 


11313 


390 


489 


RYRLGWAHACNPSTLGSRGRWI I*GQE 
FETSV 


11217 


25118 


A 


11314 


286 


434 


I LWYMNY I S I KLLKI I I YKLGAVAHTCN 
PSTLGGRGGWIP*GWEFETSL 


11218 


25119 


A 


11315 


322 


3 


FTFYGCRQFQVF* I VTKNDRI I FSSYTK 
NLLKI YLGLY* FYYVPLLRTKSLKKTGS 
FESNLSFK*VFLWSGTVAHTCNPSTLGG 
RRGWIA*GQELETSLTNMERPR 


11219 


25120 


A 


11316 


251 


389 


TFFFFEREFHFVPQVGVQGGSLGSLQAP 
P PGSHS LA* AS PRC WDGG 


11220 


25121 


A 


11317 


313 


80 


EYICPRCLSPGFRD* PGQHGETPSLQKL 
QKSAGCGGAHLRSQLLRRLGDRVRHCLK 
KRI YLSLFNSLVLSMSTRFQI I 


11221 


25122 


A 


11318 


145 


410 


APKTLKVALTS AAAARES I CQLGVLDEL 
VIFLELNRSTDQRPGAVAHTCNPRTLGG 
HGGW I T * AQE FKTNIANMVKTC 


11222 


25123 


A 


11319 


393 


3 


NFFKIIFTLSPPLPLIFTPPFFFFFKKI 
PPKPLFFGVFHSQTCPQISFFKNFYPFL 
FWFSNPFFLLFSPSRKGPPPFSRFFFFK 
VGNPLFFLFFFFFFFFFG*EGVSLLLPR 
LECNGTI SAHY YLR PRV 


11223 


25124 


A 


11320 


135 


1 


HRDVKLE YAI RKME VRPGVAABiACNP S T 
LGSRGGWIT*GQEFETS 


11224 


25125 


A 


11321 


282 


2 


NSESVLIKEKGDEEEKVLSFLLFSKRKV 
LESKTFPTLHSGCKKRSYFLKITTSSFL 
* PRVECSGVI IDHCKLKLLGSGDPPTSA 
FPVARTTGM 


11225 


25126 


A 


11322 


149 


30 


WGSGGQVRWLTPIIPAIiWKAEAGGSPEV 
RSS*PVWPCLY 


11226 


25127 


A 


11323 


316 


399 


STLGGRGAQIT*GREFETSLANMPNPIS 


11227 


25128 


A 


11324 


223 


397 


1 1 KAYQ * DVNREQ IKTII I YFQNKLKC I 
KKMIWPGAWHVYNPSTLGG*GGWIT*G 
QE 


11228 


.25129 


A 


11325 


3 


396 


N YSTAAL FLRGH I WGWRWGT L Y YG S S P I 
YCRGLNSTLGLYPLNASSNPHPRKLCKP 
KIYTHIAKCPIGHKIIYPYANTPIH*HH 
THTHTHTHTL *DS LLWPG WAHTCNHNT 
LGSWGRQIT*VQEFKTSLA 


11229 


25130 


A 


11326 


113 


442 


KRYIQIRAHSEVLCGHELWGTLHNPVQT 
IQTWEPFPYLPLCSRVTSGKSHHRFVPL 
L FHL * NRWWS HHHHHHHHHHHHHRWP C * 
ECCRAHSDSYAVWSDKVSQEPVAAM 


11230 


25131 


A 


11327 


145 


3 


ELDIKPPNPFFCHTGVQ*CDLGSPQLIiP 
PGFK* FCLS FPSS WDYRCA 


11231 


25132 


A 


11328 


189 


I 


KTNLLL IHSKX.RLHLQS SVFQC* S KSEK 
SNVCT* CAFSCLPKIiFWLGKVAHACNPS 
TLGGRGG 


11232 


25133 


A 


11329 


193 


405 


ID * QTLTKKKKKKKKKKKKKKKKKKKKK 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine T 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P—Proline, 
Q=Glutamine 3 R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














AYTSKKKT 


11233 


25134 


A 


11330 


209 


11 


GGKRGTPPTINNIFFNFFF*EKGSCCSS 
RAC*CNGEI*AHCPLRIiPGSSNSPAAAS 
RVQETATTPS 


11234 


25135 


A 


11331 


38 


390 


VDQSTVCKMGQSAGCGWGSDKGIKAGCL 
K* QRQKKKKKKKKKKKKKKKKKKKGGGG 
LKKKLFLGPGGEKFFFFFWGPFFFFGGR 
FLKRGGGKKPGYKKNKI SGHPPPFLGGG 
KKKKR 


11235 


25136 


A 


11332 


38 


326 


VDQSTVCKMGQSAGCGWGSDKGIKAGCL 
K* QRPKKKKKKKKKKKKKKKKKKNRGGG 
FIKKIFFGPGGEKIFFFFWGGFFFFGGG 
FKKRGGEEKNLF 


11236 


25137 


A 


11333 


1 


234 


RTRGKNWMAF I KEGWVPTGRMGGSF I KH 
S FLE S KNTWLGKKKKKKKKKKKKKKKKK 
PHDKI KDLFDPV*NKKQlSri SIP 


11237 


25138 


A 


11334 


244 


398 


INHLYSNKNDLIFLLVFYLFFFP*RRPG 
E P P L I KGWL P YLGGVLNLR KD P L 


11238 


25139 


A 


11335 


174 


1 


HQVYPYVKQEHYKQDSCQCYIKYMNVWP 
DAVAHGCNPGTLGAQGGW 1 1 * GQKFERT 
RG 


11239 


25140 


A 


11336 


148 


411 


LFFFPHCYLLNAPSILYLNEPKWYNLH 
LCVS AVCLLVP VHS PHRTCLY* KKKKKK 
KKKKKKKKKKKKKKKKKKKKKKGGAF FK 
KLLF 


11240 


25141 


A 


11337 


237 


453 


GQRPLLSFREKGSFFFLLETEFCFAPQA 
EGQGHNLG*LQLPLWG*SNSLVSSSQEV 
GITGAGHHPRLFFVF 


11241 


25142 


A 


11338 


65 


347 


VCRVDD FVPEARTRF FKS I * EAWNNKNI 
KPLLSTFSQVPGSENEKKCTLDQAFIGI 
LEEEI INHSSCENVLAI ISLAIGGVTEG 
KYGSVLFCLK 


11242 


25143 


A 


11339 


158 


393 


VPCTHSEGIRKGRKCLPKTRENAREERD 
KKIPPGLC*AGKLFPQEKSSTWPGKWH 
AWNPSTLEGRGGWIT*GLEFKTT 


11243 


25144 


A 


11340 


117 


3 


GIRPGAVAHVCNPSPLGG*GRWISRDWE 
L*TSLANIR 


11244 


25145 


A 


11341 


292 


60 


SGQGLRLRGAFRGKVQAKE WMP I PKLGC 
LVKDMKI KSLEE I YFFSP P I K* FE I ID F 
FLGAS FKDEVFFLFVCLFVCF 


11245 


25146 


A 


11342 


149 


30 


TQIFPPPPPPGKKKKFFFPKKKKKKKKK 
ERKKDRKNKQK* KGPGVGVSPLFPPPLG 
GQGGGFPQGLGFKTPLAPKGKPWFP*KT 
QKFPRRGGGPPKIPLFGGGGEKKPFSPG 
GQGSNKPKFFPPPPPRGKKKNFFSPKKK 
KKKKRKKERKTEKTNKK 


U246 


25147 


A 


11343 


120 


504 


G VHDR CL WDDMTRRP PAAAATAC PAGR P 
P PRREEEHSQLLL I S FQGFRWD * DQDVN 
TPNLDHLAREGVKAKYLMPPLVTMTSPS 
HFTAI TGKRHS AHFTRCPSNPQRP S FP V 
IRSKSSVSSREVEVA 


11247 


25148 


A 


11344 


213 


399 


ALVIHCLTTTIQVSGLSGSKYLMRQGMV 
AHGCHPNTLGGRGGWIT*SQESETSLAN 
TVKPCL 


11248 


25149 


A 


11345 


308 


3 


NPTPRSGDLWVEEGSPGAGNWGCLSEQT 
LRAI IKATTSYSFYCYFFETGSHSVAWA 
RVQWYNPGCSICGSSHLPTSAS *GAGTI 
GMHHHAWL I FLFFVENE 
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last amino 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoJeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W==Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


11249 


25150 


A 


11346 


189 


433 


GHCRAGTSLTLTHLGGS AMEL I IHLGLC 
LGRMKFIHTTCI ILFLFFLSATQAGGQG 
GDLS*LQPPPLGFKQFSCFSLPRSWD 


11250 


25151 


A 


11347 


257 


3 


KWGE L C F LT I KE E RMG F P LAKKRKI KGV 
F*FGLFFFFETESHPVAQAHCNLSLLDS 
SYS PAS AS * VAGI TGACHHNQLIFI FYE 


\\25\ 


25152 


A 


11348 


159 


434 


LS PS CRDVGT YSARAHPPHTHTRVRAHT 
HACTHVRAHTHTHTTAHGSHLTKEQPGL 
CTLPGSSW* PGLGGRAIATAQAWVHCAG 
IITAAHPP 


11252 


25153 


A 


11349 


240 


406 


AFKKLAKNNSMGVNRSLSRPGVVAHAWKT 
PSTLGGRGRQIT*GQEFETSLANMVKP 


11253 


25154 


A 


11350 


163 


360 


PQLLQLMKFLQN*IjIjERLRPQNGLNLEG 

lacsesrsrpcapvwrterdsvsspqpr 
pgaggrpsff 


11254 


25155 


A 


11351 


145 


406 


PLNSGAKPTLPPWSPRRPELNAPPPPPG 
EKEFFFFL,GRGFLFAPQEEAGGENIV*G 
KP P P PG * RE FFAP P PGGDGNKGAPP P P P 
LFL 


11255 


25156 


A 


11352 


122 


337 


I LWVLLLVRE S LKF FFFLLLL FFLF FET 
DVA*AAVQ*SDHSSRQP*PPGLiRGSYTS 
AFRI AGAI G 1 CHHL Y 


11256 


25157 


A 


11353 


325 


416 


KCVY I MGWA* WLTP VI PALWEAGAGRS L 
EV 


11257 


25158 


A 


11354 


260 


415 


LEELQNTACQKLE P FLSRTETKQGCLLS 
PRLFNI VLEVIiA* AVRQEKE I KG I 


11258 


25159 


A 


11355 


160 


2 


I SE I STKRNNYFL3CNWLGVVAHACNPS T 
*RG*GGQTTRGQDFQTSLANMVKP 


11259 


25160 


A 


11356 


160 


3 


ENLKLYNGSPFLKI IWLGMWHTCNPST 
LGGQGGRIP*AQEFKTSLGNVRRP 


11260 


25161 


A 


11357 


235 


408 


GQPLPAALFFYQWHLHFKELLAHPWGKT 
VWRFL* PLKKEGPHOPAIPLLDIYPKISTK 
KW 


11261 


25162 


A 


11358 


11 


393 


QTERTPTPKPHLYVSFPPNSQDPQLQGG 
KLTNRKNTHTKT P S VCHHHQ RP KVD KT T 
KI GKKQS RKTGNS KNQSAS P P P KECS S S 
PATEQS WTENDFDELiREEGFR* SNYTLL 
KEEVRTHGKEVKNLE 


11262 


25163 


A 


11359 


283 


475 


FMVS VS I KVYFF I FI FLFFEMES CSVTQ 
AGVQWHDCLSFLC*VLASSPGNSSPVR 


11263 


25164 


A 


11360 


266 


407 


LE IHHS KTL* FWLG I VARACNPS TLGG * 
GRWIT * GQEAE 1 1 LANTVK 


11264 


25165 


A 


11361 


93 


1 


DVLS II»LPRIiECNGAISAH*BTFRIiPGSS 
NSS 


11265 


25166 


A 


11362 


131 


377 


MEEMIKGRGLSKPPALSSYLTLKSS*LK 
RVRARRRL FLKS RFS BDLQVAGFFVCL F 
VFWDRVSLCHPGWSWVQSRLTAALT 


11266 


25167 


A 


11363 


212 


392 


QAEECAPLFIGVKE P PKNLTHWLGTVAH 
TSNPS TLGG* GGQ I TWGQE FETS LANMA 
KPCV 


11267 


25168 


A 


11364 


141 


3 


NELWLNHQNVLE WLGT VAYT CNP S TVGR 
WIT*V*EFETSLGNMVK 


11268 


25169 


A 


11365 


215 


384 


ANFNIRPEISLASKCPSERKSRTLLPLN 
QKhEMl * LSEEATS KAKTG * KLGLLHQ I 


11269 


25170 


A 


11366 


48 


373 


EPWTRKEVLKISSSLKIHCYLSIIiCLIL 
MPTLECTAAIiVIAYSNLELPGTSCPPTCA 
S * VAGTTGACHRAPTVIiKI FLRDR IFFF 
FFTNVTYPWCHTPADPAIFKANAR 
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nucleotide 
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residue of 
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Predict- 
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location 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginme, S— Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, "=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


11270 


25171 


A 


11367 


310 


2 


GLICLNRENAYFFVIRQISMSCLEKEKP 
ARYS S P YAKS VTRGEKRRQRFFLSRAP P 
I PFLFF I * I V* AESRSVAQAGGQWLHFS 
SLQTPP*RRTSDSPASAS 


11271 


25172 


A 


11368 


303 


50 


MFALTNRGVQGS P VI FLGVFLI SKMGFT 
CFKGFSRFKVLVFFLGVFPSFFFFLRAR 
VLLCCPGWSPTLGIK*SSCLGLPQCVLT 


11272 


25173 


A 


11369 


109 


419 


QSNASLSHFTSPPLSVADR*LFSTDHKD 
IGALYLLFGA*AAVLCTAISLFILDWLG 
QPCNLLRNDHIYYVIVTAHAFARIFFIG 
IPIIIGGFGN*LVPIiIIGA 


11273 


25174 


A 


11370 


112 


258 


GGRFKGSNFTSAGMQRNIFFMGPPK*NS 
RAGV* QRGEGKNPGVTKLNRL 


11274 


25175 


A 


11371 


823 


1078 


SQHFGMPRWADHLRSGVQDQPGQHGETL 
SLLKIQKLAGRGGRCL*SQLLRRLRQEN 
CLSLGGRGCSEPRSHHCTPAWATQ*DSV 
S 


11275 


25176 


A 


11372 


184 


420 


I KQQKNLFLALVKELVQLLWKTVWQFLK 
KLNIELLCDPAILLLGIYPRELKMYVHT 
KTCT*MFIVELFIKSKK*KQPTC 


11276 


25177 


A 


11373 


234 


44 


LiATITDRLLALSDSAIPPLGIYAKEKKS 
VYQKDNCTYIFVAALFTVAQI *NQPKCP 
STRTRGS 


11277 


25178 


A 


11374 


73 


456 


PPIGSPLVPYIPSPALSSPQAPRMGSHP 
CLIPSASCHPPLETDFLSLLPTHFCLAV 
FTKERFS PPPPS P YPRF* KFLRS * KFS F 
FLPPFLFFGGTVLLCLSGWSAVAQSWLT 
AASTSLGSSDPPTSTS 


11278 


25179 


A 


11375 


43 


456 


E FFHHVGQDGLDLIiTS * SAHLGLLKCWD 
YRREPPRPASDGHY*TDATGSLPSSGTT 
* IRTKPSQAPASWGLWNLAHHPPRSHPS 
CPMANLICSTLSSFDGGSPGTGPGGWCP 
LGLSGS PARAVFKDSS CSLHPLATGI 


11279 


25180 


A 


11376 


288 


484 


EWVLSAVGGSQYGVCLPFLHCFI FFETE 
SRSVAQDGVQ*GDLGSGSSLPPG 


11280 


25181 


A 


11377 


129 


1 


NFNALNLIRAGAVAHAYNPSHLGGEDGR 
IA*GQEFETSLSNTA 


11281 


25182 


A 


11378 


307 


1 


DSEIPSLGLAKFWNHRREPPCLALFLTL 
NQKLEMIKLSEEGISKAKTR*KAGFLCQ 
RVSQDVNAKEKFLKE I KRATLVKTQMMR 
HONS F I ADAEKVWMAWMK 


11282 


25183 


A 


11379 


200 


397 


CISLFSHYYKDLPETGSFIKKRGLIGSH 
WL,GAVAHACNPSILGGQGRRIT*GQEFE 
TSLASMVKPC 


11283 


25184 


A 


11380 


167 


1 


SNGLYSREARMVQHTHIRPGWAHTCNP 
STLGGHGGRIT*AQELKTSLGNIVRPC 


11284 


25185 


A 


11382 


12 


395 


AHSSFLSLDLSVFATCPHPRAETQTGHR 
FSTLLPLSAIiYPK* D YFKKKKKKKKKKK 
KKKKKKKKKKKKSGGGA 


11285 


25186 


A 


11383 


160 


2 


TMEMMLDIK* I *EI FIiFEFKMGLKAAET 
TQNINNTFGPGTANEQTAQRWFNK 


11286 


25187 


A 


11384 


274 


389 


GQAQWLTPVIPALWETKAGGSLE * GQAQ 
WLTPVI PALWETKAGGSLELKNSRPSLG 
NMVKPP 


11287 


25188 


A 


11385 


389 


37 


WSYESSWSFIFKSFLFSFMISAYDM*YA 
SQILFTPKLYRTHPYFLLVLFLILSILV 
ILETGSRFVSRLECGGGISAYGSFK*LG 
SGNPPAAASL.SSRIPYVHQSCWSDLAFN 
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PEFEM 


11288 


25189 


A 


11386 


214 


374 


KGGGRFKGSKFTSPGGGEKNFFIGAPK* 
NSRAGV* KRREGKNPGVTQLKKGVG 


11289 


25190 


A 


11387 


142 


410 


RFVCS T I KVLRDLS SDRSNPGRFLSTSN 
* KKKI KKKKRGAP FKKKNFFYRGGGGKY 
FFFGAP KINFLGRVFQPGGGENPGRS P F 
KGAWG 


11290 


25191 


A 


11388 


144 


1 


CLSNEDS IKTPKGLGTVARTCNPSTWGG 
RNGW IT* GQEFKTSLANTVK 


11291 


25192 


A 


11389 


167 


2 


RNISSHLLNiVIAITKQTNWlSnSrCW* GYRE I 
ITHIHC*WEFKMMQLLQKLVPQKVKHT 


11292 


25193 


A 


11390 


2 


412 


S DQVQDAGRL I S S EAS LLGLPTATFLLC 
LPTGFALCTRVPGVFLLLRTS VI LREGP 
TLVTSFYLNYLLKDLVSIQSHWGLGLQP 
MNFGETQFSL*QKLV* KEKESMPGMVAH 
ACNPRNLGGQGGL I T * GQE FKT I LA 


11293 


25194 


A 


11391 


1 


206 


TGRLAGHRVSSLP IRGGRAEAPLTSRTG 
RLNQDGLNLLTS *SARLGLPKCWEYRRQ 
PLRLAVISGFLT 


11294 


25195 


A 


11392 


262 


388 


VS K* KKKKKKKKKKKKKKKKKQ 


11295 


25196 


A 


11393 


78 


414 


ICFFLRAPPFFFFFFFF*TKIFFFFPAL 
NSMANFFFFWTPPFLGGIFLPPPFFLEG 
GGPGLASPPGGAFFF 


11296 


25197 


A 


11394 


238 


11 


I KWETAGRS S ACL * S * LFERLRRENPLN 
PGVHG*DEI* *HHCTPAWVTVRPRLSKY 
INKNKK*ARLAARCGGSRL 


11297 


25198 


A 


11395 


340 


1 


KHILVPPFINVSEIQRYSLV*HLLFVGV 
*NLFFLFLSFKKPVLCSM*IICHEIISP 
PSKPKCKASHLSFV*KKNSNLWLGAVAH 
ACNPGTLGARGGR IT * G WE I ETS LTNME 
K 


11298 


25199 


A 


11396 


363 


245 


QMDLNRHLS KDDK* MANQCMNRCLTSLA 
IRQMQIKTIMI 


11299 


25200 


A 


11397 


292 


2 


DPKKEAGRGGAQL*FQLLRRFKHETHLN 
PGG*GCSESRSHSSPPAGATK*NGVSKK 
K*Y*APKEERHPRPVLVEPRVPQVPSPG 
SQTYRQDHSPRV 


11300 


25201 


A 


11398 


121 


321 


KKLYTHKSKSDVMIFLKIETVSWLYCPG 
WRVMAH CDLKLLGS S KP PTS AS * VAET S 
GT CHHARL I FL 


11301 


25202 


A 


11399 


118 


394 


QALPQPIPVLILLQFFFFFF*GTGFCFV 
AQIGGQGGHYG*LEPPIRGLKPSSRRTL 
RGGWD *GGHHNAQL ILGFFEKTGS CF VV 
QAGLKALA 


11302 


25203 


A 


11400 


273 


379 


AE PGWAHS YNPGTLGCRGGG I T * AQE F 
YTSLGNM 


11303 


25204 


A 


11401 


360 


38 


KI FGQARWLTP VI PAFWETE VDGS P EVG 
RSRPA*PS 


11304 


25205 


A 


11402 


254 


404 


G PAGHD FHM I TPS KS TE TGLGAVAHVCN 
P S TLGGRGGW 1 1 * RQDLETS LA 


11305 


25206 


A 


11403 


161 


3 


I LTPTLQYI KINLRGQAWWFTHVI PALW 
KAK*SGGLLELRSLRPAWIEGDL 


11306 


25207 


A 


11404 


1 


170 


MGVSLLSPRLECNGVISAHCNCHLL*FT 
FK*FSCLSLPSSWYRCVPPCLATTFLIY 


11307 


25208 


A 


11405 


249 


3 


NAKSRIQVISIWVFTVKFFHLYYLCNFS 
Q *NEMAGGQS I FRKKHLDFSLEHS GMMT 
THCSLDLPGSSDPPNSASSVGGTRH 


11308 


25209 


A 


11406 


70 


401 


CGKHGVNLI IHSFLNHLKYLLLLQMANL 
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K=Lysine, JL- Leucine, M=Metbionine, 
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nucleotide deletion, \=possible 
nucleotide insertion 














FISQFLLrCVSEPLLF*YCHQIIKVGRL 
HEVKQMRGTKKQEKMFPYWNNDKNVCPG 
VPAHACNPTT VGG * GGWI T * GQE FNT 


11309 


25210 


A 


11407 


257 


389 


GGSLLQKYLRPLVAFLFSFFFFSFETES 
* SWRLE CNGM I SAHC 


11310 


25211 


A 


11408 


364 


420 


SQPFGRPRQADCLSSG*AQWIiTPWPAL 
WEAKAGRLLELRSLR 


11311 


25212 


A 


11409 


312 


413 


TSGMGPGWAHACNPITLGGQGGWI T*G 
REFETS 


11312 


25213 


A 


11410 


283 


425 


HGHKDYVHWLGAVARACGPS I LGGRGMW 
IS*GQEFETSLANMVKPCL 


11313 


25214 


A 


11411 


389 


3 


CR I DGLF FNLVFLRL WWKF * V I FFLS Y 
FS FLVI I LQFLT YMLNCTFF * Q I UNTL FP 
ILVLLTKILNTLLSTENYCHFSLLYCLE 
FWFHPIIKHKMYNLFLI*FILIFILRQS 
LARS PRLKYSGT I S I H 


11314 


25215 


A 


11412 


267 


3 


QEFQQEDTSELDIjEIRSIiCKRKGGLKQG 
DTKDNLS KRWVETDKGKAAWWPGAVTH 
ACNPSTLGGQGGWIT*GQEFETTLTNRC 
PRV 


11315 


25216 


A 


11413 


3 


281 


GAWSCRIPGEGSLSASQAIQPFSLYPVN 
PQ* ISLSARKKKKKKKKKKKKKKKKKKK 
KKKGGGGVLKKI YFPPRGEKKFFFFWGP 
PKKTGGGGF 


11316 


25217 


A 


11414 


275 


2 


KTAHFQLFNSDFKFPGSRNPGQSWSPRA 
KQS FCTQALLKYFLK* MNEMKLNKSRSG 
AVARTCNPSTLGGQGGWIT * GQE FETS L 
ANMAKPR 


11317 


25218 


A 


11415 


319 


1 


RNLGVQVYEYWLVYKYCSVSSNNKMKN 
KRDLLVI*VPGIVIHLIYRLIKENPEIiY 
ISLFIYSMRSLKKLTYRPGAVAHACNPN 
TLGGRGGWIVCSQEFETSLGNM 


11318 


25219 


A 


11416 


121 


428 


LSCDKWGNRDMTKMLSKELKKKKKKKKK 

GGGGALIKNILFPSRAASGFFLSLLKYK 
KLGAAVYI ILGEKT I LWL 


11319 


25220 


A 


11417 


79 


253 


KIFNFQFSNiMLENNSHTTRIKNYFPGRE 
RWLTPVI P PLWEAKAGRSLE VRS * RPS C 

P 


11320 


25221 


A 


11418 


180 


4 


EVGFMKPGLCGVFLEAGSHSVALTIVQW 
CDLGSPQHPPLGSRDPPP*ALQIVGNTR 
RVP 


11321 


25222 


A 


11419 


154 


3 


STLKDKENQKLS IQPDTiyiAHVCNPSTLG 
GRGGWI T * VQEFE I SLNNMVKP 


11322 


25223 


A 


11420 


436 


478 


AI PLPH* RKKILHAALFFFFLKGS FVLS 
PRLECSTKIWAHCSLPPPNISNPPASSS 
QEGGTT 


11323 


25224 


A 


11421 


143 


398 


AAFGFWFVLIRFLRQRLALSPRLDCFFA 
KSASCQPRPP*FKRVCWLSLPKSWDYRF 
DLLCAAGLVWVYLS VSRLHINE KNQ PP P 
P 


11324 


25225 


A 


11422 


1 


138 


MVFHRVGQDGLDLLTS * SAHLGLPKCWD 
YRHE P P RP AAHS YFLLHK 


11325 


25226 


A 


11423 


2 


390 


LELPILLLNTLILFLFLILYIYLVFSPN 
AFRILFITIVLESFSFILLVPSKPWKYS 
L ILSNAF VLCMM I S S P P FPLFS LS TTR I 
NKKLDFWLGAVAHAYNPSTLGSQGGWI I 
*GQEFETSLTNMAKPHL 
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Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartk Acid, 
E=GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M==Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \-possible 
nucleotide insertion 


11326 


25227 


A 


11424 


267 


414 


VFQITFCIKNFWPGWAHACSPSTLGG* 
GGR I T *AQE FETS LGNMVKP C 


11327 


25228 


A 


11425 


189 


73 


HQDGLDLLTS * STRLGLPKCWDYRHKPP 
RSARPNAWAP 


11328 


25229 


A 


11426 


318 


33 


KKT I S SHARGRKLGS FKTAAP F F FFF * T 
GSCSIiAQAGVKWCSHSSMQPRPPGLK* F 
SASGSAGIIGMSHHTRPIAGFLLAKLKK 
KSNSTGYGGAHV 


11329 


25230 


A 


11427 


291 


393 


DRVSLCLPGWSAVLPS*LTAAQISWAQV 
IFLPQP 


11330 


25231 


A 


11428 


51 


399 


KRVARQNPPNGQGFGWCSCSPILYGQSR 
AQGENIYKLLGSGEWPGQLI IDVEREQL 
EDWRQGDLGRGMWMTKARVHKGPGLVAG 
TCNP S TLGGRGRW I T* GQE FENS LANM V 
KPCF 


11331 


25232 


A 


11429 


242 


420 


VKKKKRGGRFKGSKFTSACLQGISFFKG 
APKLNS RAGF * Q RWEGKNLGVP Q FNR FG 
AHP 


11332 


25233 


A 


11430 


77 


405 


TKRGDSS FCLYRIW I IDSVMKWNQSKAA 
AAAAFFFFLRGTLLFLFRLKGKVGASVI 
*NFCLRGWGGSPASPSKEVGMEGACHNS 
GLFFVFLEKRRFPLVGQEFKTTRAN 


11333 


25234 


A 


11431 


121 


439 


LPLPTSGPEDPLALSFKPTPFPSTLSGM 
IYTGHPSPEHTSHLMPAPPWNNKACRF 
FKT * KKKKKKKKKKKKKKKKKKKKGGPL 
KKKKIFAPPGGEIFFFFWGPKK 


11334 


25235 


A 


11432 


109 


283 


LTPWNLDSRCEIKGNGKPQENHLNLGGG 
GCS E P KSHHC T PARE KRQNS VS KK* INK 
IK 


11335 


25236 


A 


11433 


296 


1 


FLKKPSKGFSKLPHLPQNLNLFFPPK*K 
CPRAKKDIPPPSF*KISSLFGEG*GFTG 
SRPPSLNIiENPPKRIFFFFFFLDRVSHC 
RPGWSAMASSCPRV 


11336 


25237 


A 


11434 


385 


418 


QSETLSQKQKTK*E * SWLGAVAHACNPS 
TSGGRGGWIT*GWEFENSLTNMVKPCL 


11337 


25238 


A 


11435 


3 


405 


LLHQGRLMLGQWGNGRTKTLLMRENVGK 
FPAE PS CSALGVWLQAIACYPGLVRRQ* 
ANHWE KKKKKKKKKKKKKKKKKKKKSGG 
G 


11338 


25239 


A 


11436 


221 


3 


KHEFQTIQMESMK*NTSIKICIYAS*IQ 
DCLRYQWQRPAAVAQAITEHFGRPRQAD 
HLRS G VS DQPGQHGET 


11339 


25240 


A 


11437 


288 


366 


KMFFSWA*WLMPIIPELWEAKAGGSL 


11340 


25241 


A 


11438 


160 


370 


LW YH YFP VRDGLT I LVT YQLKAI RRLLT 
LKSRS GAVAHT CNLNTLGGRGGW I T * GR 
EFETSLANMEKPHL 


11341 


25242 


A 


11439 


230 


404 


MQHGQVG I Y SQGGGWRP VDKKN S QLGT V 
AHT CNL I T LGGRGGW IT* GQ E F KT S LTN 
MV 


11342 


25243 


A 


11440 


302 


54 


LSGVWFLTPPPKGGGFPTP*NGGAPGFF 
PPPPFKNPPPEGKLGGPKKKKNFPPPGG 
EKLVFLKGAPPFFFFFFFFFFFFFFD 


11343 


25244 


A 


11441 


271 ! 


416 


KRTLSTELQYMCCLLI KSKS IFK* KKKK 
KKKKKKKKKKKKKKKKKNPPSLP 


11344 


25245 


A 


11442 


254 j 


392 


SVTGMT IMTSSCC YKI QKCTKI QKLARH 
GGTCL* SQLLRRLRQENS 


11345 


25246 


A 


11443 


390 


150 


LLKNQKTLGGPARNPPPLGGPNGGVPKA 
GNWGPPGPPGGTPFFFKNQKITPGGGGG 
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P * FPPLRGGGPGNPLYPGGQGCH 


11346 


25247 


A 


11444 


122 


1 


PAPLFFFFFFETQSCSVA*AGVQWRNLG 
SLQPPPDDGTRL 


11347 


25248 


A 


11445 


272 


403 


RP KVD KTTKMGRNQ S RKAEKS KNQ S AS S 
P P KDHNS S PAME * S WM 


11348 


25249 


A 


11446 


70 


397 


LCHCAPAWATERDPVLGEKKRTMMSYCH 
P PMRMVE KKI VGD I KC * * G YGE TG F LTH 
C*WECKMVQIL*NIV*QCLKNLNVHSPY 
ELAIVLLGIYPREMKTYIFTKKTSM 


11349 


25250 


A 


11447 


3 


401 


HASAHASAHASAYYRLCAYPKSSFLYPS 
SFLLWSNFPLNILDTTIGPKATETLFI 
KHVMHDVISMQLKSKTFYNYFFLLSLNV 
'FYFVLSCCKRRKRSQLGMVSHICNPSTL 
GGRGRWIT *GQ EFETSLANMA 


11350 


25251 


A 


11448 


333 


1 


KSGCIPPKGQWLFFLQKTARIGSPKPV 
SPPRGKGQFFPQGLRGFY*FLPPPPGVP 
EIFFFFF*DSSHSVTLSPRLEYSGMNTA 
HCSLHLPGSSNPASDSHIAGSTGTCHH 


11351 


25252 


A 


11449 


104 


402 


WCRPLLGGVSQLGYMGVMDPFKEAVCL 
FSELKHCAGRTTALFRAVRQGCLSLKNF 
LLP F FQL C PAHRGGVL * RQ * ALLS CGGL 
HPVRASRPLCLPTQA 


11352 


25253 


A 


11450 


198 


2 


KYKFMAGLGGSCL*SHHFARPRQQDGLS 
PGV*NQPGQHGETPSLQKKVSWAWWHAS 
VEAEAGEQP 


11353 


25254 


A 


11451 


171 


3 


TPPPRATSLFFFF*TESHSVSQECNGTI 
SAHCNLCLPGSSDSPASASRVAGITGA 


11354 


25255 


A 


11452 


177 


461 


PMTGGKYISNSTCSKPNSLSPHLPLKTQ 
TKNMEHFGRLTRGDCLSPGV*DQPGQRG 
KTLLLQKI SQALWDVTAHAWWPGLPTR 
ERRRSLSFPTG 


11355 


25256 


A 


11453 


247 


3 


EGECCGIESR*ILHLKGRICWHINYLKI 
TKI KN * VI L KKM YKQ KLRPGAVAHAYN P 
SALGG*GGRIARAQELiETRLGNIVR 


11356 


25257 


A 


11454 


328 


57 


EVHSNICATLSTHQMKILTKGTRHQHIS 
*GFF*EGEVLTLSTKLECSGVIMAHCNL 
NLGGS SD PLALVS QVAETRGAS YLLLAI 
CPKVNK 


11357 


25258 


A 


11455 


70 


435 


AT RAKLHL KKKKE GR KKE FF PHWQGQP T 
PALVFPQPRTHF I FKVS IFSLKKNREQL 
PYTVQTQSLLPLNSHWGQVA*AGVQWRN 
LRFSCLSLPSSWDY 


11358 


25259 


A 


11456 


112 


401 


KCWDYRRERPCPAPNICIF*LTFFFLGK 
KLNQSFFFFFFLKGSLVLPPRLKGGGAF 
L FN*NLTLQGKNNS PAS PS* RAE TLGLG 
PHAKLWFVFLKK 


11359 


25260 


A 


11457 


198 


3 


EKTSVKQSGDSNKKAWLEMKTNKKSLKR 
* G WAHACNP S TLGGRGGWI S * GRE FE T 
SWTNMEKP 


11360 


25261 


A 


11458 


179 


1 


FFHMYmiVCEYTITLSVNINSQGPGPR 
AHAYNPSTSEDRWI P *AQEFKTSLANMV 
KPH 


11361 


25262 


A 


11459 


210 


5 


RLIHYFFKNNLPSGGGSTA*GQEFEIRL 
DNTVRPHFLKK*KISQWWCMVWSATQ 
EVEVGNGLSPGV 


11362 


25263 


A 


11460 


139 


2 


RI LALS KLFFFFFKQESHFFLPMLE CSG 
VI SAHCNL * LLGSSDLS 


11363 


25264 


A 


11461 


388 


272 


S* KKKKRGGRFKGSMFLSPAGQGNPFFM 
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nucleotide deletion, \=possible 
nucleotide insertion 














GPLKS I SLAW * QRREGKNLGAPH I TRL 
GAHP LFAGWHKTQGAE F 


11364 


25265 


A 


11462 


2 


387 


C VWKSNNKKAP P CVAALRFRWKDDMDEM 
AP G KHWGGLTS KVQS GGLMGR YTG P LT R 
TENS ILLKNPYFEAGHSGSRIi*HQHLGR 
PRWEDYLRPGGQGCSEL*SSHCTFT*VT 
EGDPVS KKLKRTI FGG 


11365 


25266 


A 


11463 


325 


398 


SLPTWAVCAPRPSG*GSGIPELKTMLAG 
VILEDYLDIKNFGAKWGLSCTLATGST 
LFLGKVGPFVHLSVMIAAYLGRVRTTTI 
GEPENKSKQNEM 


11366 


25267 


A 


11464 


174 


2 


YLE I LFFFETGSRS ITQAGVQWSHPGSL 
Q T P P P C P SNS P AS A* VAAHAS ADAWADA 
W 


11367 


25268 


A 


11465 


105 


411 


TDVE LLLMDE * RKWFLELETTPGEDDVN 
IVKMTTKDFGYS INLVDKAVSEFKRIHS 
YFERGSAVGKVLSHSIVC*GEIFHERKS 
PLMQQ I SLLTF ILFYFSE 


11368 


25269 


A 


11466 


291 


404 


SQEGVKIKRQGTVAHTCDPSTLGG*GGR 
IT*GQBFKTS 


11369 


25270 


A 


11467 


64 


236 


THASGLDLLTS * SSCLGLPKCWDYRHEP 
PRPARFSSFYSGSLLNYLAKI IKRDAFC 
I 


11370 


25271 


A 


11468 


278 


416 


NPQISCLKQHVWGAVAHICNPSTSGG*G 
GRI T * GQELKT I L.VNMVK 


11371 


25272 


A 


11469 


178 


325 


RPE VKDQGAS CRDQPHRVGGLLPVCSDE 
RVPQKKKKKKKKKKKKKKKKKSKKKKKK 
KKAS SGGARF * KKKKKKKKKKKKKKKKK 


11372 


25273 


A 


11470 


264 


60 


DINHQSSWLSGRWTPLSWVSISQREDRR 
GFSASIYIYIYIYIYIYIYPPIHIYIHT 
YT * S YHFNQNT 


11373 


25274 


A 


11471 


95 


399 


DFPHliLLiFLPLSQFCTACCPVPENRCFM 
YFFLIFSALCWEMNSRISHSNTARVEQC 
CKVFKILQMKSVLGQVQWLTPVIPALWE 
AEAS * LPEIRSRNQPGQ 


11374 


25275 


A 


11472 


139 


2 


NHVSTEKLSLHTMAHTCNPSTAGGQGGQ 
IT* GQE FKTS LATMAKP 


11375 


25276 


A 


11473 


179 


3 


RARWNPQVFC*GFIiPVFPKILPIWPLRK 
GVPFPPNFFFFFLRVSLCHPHWS AVARS 
RL 


11376 


25277 


A 


11474 


282 


12 


KYIRNE*IVLFVIYFCRNRISWCCPGWS 
* TPELKRSSCLCLPSCWDYRHEPLCPAE 
MNKLDRLNKMDTTKWIQQKKKNHRTQRQ 
DNKII 


11377 


25278 


A 


11475 


145 


3 


VCMQLRTD FQPGAVAHACNP S TLRC * GG 
WIT*GREFKTGLNNMEQPC 


11378 


25279 


A 


11476 


61 


397 


PQTPRLKQS YHVS LPSS WD YS W I YHERL 
VNPGRETDWATCYSGGNIQLQENKLKTP 
TDSTLWVSLCCPRLECSSAIPADCSLNL 
PG*SDPPTSTSHVAETTGACHYAQLIFG 


11379 


25280 


A 


11477 


156 


3 


FIFLKSSQNSYI * KCWPGAVAHACNLDT 
LGG* GGW IT* GHR FKPTRRPRV 


11380 


25281 


A 


11478 


224 


383 


CW VC L I HRAFLAPRP VLG CQ * Q KKKKKK 
KNKKKKKKKKKKDS RGGG 


11381 


25282 


A 


11479 


339 


5S 


NFKNKLCFCASGYLDSKDSQHYI IFFIR 
WSLALLPRLRWEDCLNPGGGGCSEPRSY 
HRTPAW VI E * DP VS KK* IK* NK I K* KQQ 
DWLRLHDKIF 
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residue of 
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Predict- 
ed end 
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correspon 
ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C-Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N— Asparagine, P==Proline, 
Q=Glutamine, R=Arginine, S— Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /—possible 
nucleotide deletion, Y=possible 
nucleotide insertion 


11382 


25283 


A 


11480 


244 


407 


S FGKK I G FL FGP KKFNGGS YGN Y YK I NR 
RFFFPILGEGFSPHVYF*NLETRTTGGG 
LLETTKTTFNLLGRKLGS FLGPRSLTAA 
PTGTTIKLTGGFFFQFWGKVFPRMYIFW 
KDEWKKGF 


11383 


25284 


A 


11482 


308 


30 


SPQKNPPTYSGFKKNLSASPGQKAKPRF 
FLKNQFLPGIMGGGRQFPLLRRVRPPNC 
FNFGGKRCN * QKLGPCP PGGP KETVS KK 
KKKRERQQ 


11384 


25285 


A 


11483 


226 


407 


NHWNKK* FW P KVAHT YNLS ALGGQGGR I 
T * GQE FE TS P SNMRPHAS 


11385 


25286 


A 


11484 


10 


411 


QPPIYFLSLRISLFWVFHINGIIQYVAI 
CVWLVS CHNFSRF I CVAACDQDFIVFLW 
LNNIPLCGYNILCIHSICPNWRTTGLFP 
LLAI I IKL*TFVKKKKKKKKKKKKKKKK 
KR 


11386 


25287 


A 


11485 


184 


1 


TDEELLPMDEQRKWFLEKKPTSGEDAVN 
IFEMTRKDIECYVH*VDKAAAMYERIDS 
NFLSN 


11387 


25288 


A 


11486 


435 


222 


AKNLNRHFSKKVI *MASRHI KRCSI SLT 
IRET*IKSIIR*HVIPGQMTFIQKTGDM 

FGSVS PL.KSHHLKL 


11388 


25289 


A 


11487 


317 


408 


FRPGTVAHACNPSTLGGQGGWIT*GKEF 
KT 


11389 


25290 


A 


11488 


340 


469 


GKGDTKCHLWLGTWHACDPNTLiGGQGR 
RIA*GQEFKTSLSNM 


11390 


25291 


A 


11489 


38 


389 


KRPTSIiKKSNFCSLQGFFFFGLSQKPGK 
SNRS PGKKKNP FFSGGKAP * KGF * KI FL 
S PGLGGS P QKPHLLGGLGE KNYLTLE KG 
GCRD PK*CPCFP P WAGEGNP FS KKKKQK 
FPFFG 


11391 


25292 


A 


11490 


235 


2 


FVLMWLMPKLHIPVTEFRRLFAFPEFL* 
FLFCFVLSYFVLLETGSHYVTYAECSGA 
ISVHCSLNIiPGSSDPPTSVLV 


11392 


25293 


A 


11491 


25 


417 


GTLCLRIGFINISHHCMILKENSLFFFF 
WETKF YFAPQADGRGPNLG * WNP PS PG * 
RGSPGPTSRKRGNGGGGPPGPVIFGFLR 
KNGVPPGGLKGPKSLAPGIGPPGPPKDR 
GLRGGPPAPGLLKKNPKVW 


11393 


25294 


A 


11492 


234 


438 


MWI IAQQCDCTECHY*LKMAKMINFML 
FIF YHTHTHTHTHTHTHTHTRERGPS YI 
YGKVS FKLTHL 


11394 


25295 


A 


11493 


344 


477 


CFTMFI S * IKT* KFNISQLVAHTCNPS I 
LGGQGGR I T * GQE F KT 


11395 


25296 


A 


11494 


35 


482 


GI PGFHHVGQNGLDLLiTS* S IHIiGLiPKC 
WHYRPGPPRPAISITPST*AYLLGNQKI 
GVTHFLAILALFFSGTKAMISPKYVCI I 
FYLLVEEHLGCFYFLAINTATCVCVYKL 
FDTYMYQVESHATHEFISYYFFKILCSL 
SIRLYAVLL 


11396 


25297 


A 


11495 


307 


484 


L S P S ERHE LiKC VL W P GAVAHAYNQ S T VG 
GRGRQIT*GPEF 


11397 


25298 


A 


11496 


487 


349 


FLIKTEFHPIGQEGFDFLTL* FARLGLP 
KGWDYKGEPLRLAFWHPF 


11398 


25299 


A 


11497 


224 


417 


AS GAS ALKVEL CS F S R YK I Y PRHKRR YA 
RTDGKVF*FLiSAKCEGITFFLSFFFFFL 
EVLFCSPG 


11399 


25300 


A 


11498 


17 


411 


KRLVFGGRGRTDRF I S QYV* AE YVS INL 
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Amino acid sequence (A=Alanine 
G=Cysteine, D=Aspartic Acid, 
E=GJutamic Acid, F=Phenyla1anine, 
0=Glycine, H=Histidine, I==Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan } Y=Tyrosine, 
X-Unknown, *=Stop codon, /=possibJe 
nucleotide deletion, V=possible 
nucleotide insertion 














LIFLTKKKKKKKKKKKKKKRGGALKKKP 
WGGQNKTGEKKKNFFFLRGGKKKPWGDF 
KKKN F F WGGEN WG KP P P KKKT P WGKKK I 
FKGKGGKTPPYFCFLKKMG 


11400 


25301 


A 


11499 


117 


370 


VSESADLAWSLGICFFIEFPSDADVAGL 
ETAFFLGGWTGSCSWQPGV*WCNHSSL 
QPRPSGPRQSSCLSLPSSWDYKYMPPCL 


11401 


25302 


A 


11500 


244 


1 


EELFECYSCSPTFSSDPLTTPLLILTT* 
LLPLTIMASQRHLSSEPLSRKKLYLSIL 
ISLQISLIITFTATELIIFYIFFET 


11402 


25303 


A 


11501 


130 


55 


RFVGST I KVLRDLS SDRSNPGRFLS TSN 
S SLKKKKKKGRP F * RDNS AI L F 


11403 


25304 


A 


11502 


186 


421 


SFRLKIAGIIGVHHCTWLWLILNLVLL 
K* ITFALKIMMFSKYSDIGQAQGLMPVI 
PALWEADAGGSLQPRSLSLGNK 


11404 


25305 


A 


11503 


1 


213 


GELLFLYIFA*WLFLLPKLLKFVIVCLC 
ELQFNERFIMVLC I I I ILGVGGFFLFCF 
LL* FCFDFFFFFWRG 


11405 


25306 


A 


11504 


82 


6 


AGVQWPRSWLRLP*LFFTQVLMIFFPPF 
YHQNFFFP*KGFFFLGGLSHFFPPPNKG 
FFFNYPQGFFFSPSLKKKFFFFSSPIFL 
APPGIFL*GPPRFFFFFFFFFF*EGVSL 
CHLGWSAVAAI LAP PAGFTPS S CLSLPS 
S 


11406 


25307 


A 


11505 


245 


440 


WAPCIG*QLLKDPQVLFAGYKVPHPLEH 
Kill RVQTTPD YS PQE AFTNAI TDL I S E 
IiSLLEERFR 


11407 


25308 


A 


11506 


256 


370 


GKLM * GHHVGQAGLKLLTS SD P PALASR 
SAG I TGVNRHAP PRLNVTL YTKGQT I IL 
YHNQYNQ 


11408 


25309 


A 


11507 


1 


149 


GCMWPRTVLFAFQHKFVNFLKTLGDFGQ 
VQWLMRVI PALWEAEAGKLPEARRWRPP 
* PRTVLFAFQHKFVNFLKTLGDFGQVQW 
LMRV I PAL WEAE AGKL PE 


11409 


25310 


A 


11508 


316 


420 


YTY I YE PG * VQ WLT P VI PAL WE AKAGGS 
LEVRSLR 


11410 


25311 


A 


11509 


136 


2 


LDNRLWPGAVAHACNSRTLGDQSGWIT* 
GQEFETSLANMMKPHL 


11411 


25312 


A 


11510 


167 


420 


P T P RS GNTALQ P PHHTHTHTHTHTHTHR 
HTHTHTHTHTRACI * KVRVRALFS PKKK 
PLFCLSVSQIYVGRCLLKRLLFIFLYRH 


11412 


25313 


A 


11511 


124 


415 


IYISVANTIILIFIAIHVLAISFFFFKK 
EVLFLPPRREGGGAF*VN*NLCFtfG*GN 
F PA* P S LKKGI TGALYTPGLF F VFLKKT 
GFRHVGQAWLDFL 


11413 


25314 


A 


11512 


121 


1 


PPVKFWAPFFFFF* IGSPSVAQAGVQWN 
NLSSLQSPPSCL 


11414 


25315 


A 


11513 


245 


13 


VLFILSGFFGPINYPTSPPPQLPFPASG 
NHPSTLYLHEFNCFYFYFLFIYFF*DRV 
SLCRPGWSAVARSQLLVPSDS 


11415 


25316 


A 


11514 


434 


2 


RPPFFPVWGSPPPFFFFFGEGIWGQEER 
KGASSPPWSRGILRGGGPFGGPGAPPT 
PLKKPPLFFFLAPPFLGVFPRFFSFPTV 
FFLPFFVCPKKREALLGVPKGPRPFFFR 
SFLFFFSGD*VSLCRPGWSAVARSQLTE 
SLAS 


11416 


25317 


A 


11515 


247 


390 


QGLRIYSQDFFFFLETKSRFVAQAGEQG 
GNLG*LEPPPPS*SDFMASP 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
TXThreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibie 
nucleotide insertion 


11417 


25318 


A 


11516 


269 


368 


FFFERESCSVTRAGVQWHRLSSL*PLPP 
AFRRF 


11418 


25319 


A 


11517 


256 


404 


NILKI FYLRI *KKKNQLGWAHACNLRT 
LGGRGGWITRGQ* FKTSKEKMVK 


11419 


25320 


A 


11518 


211 


404 


YLCFTRCSVHSHLK* VFLHKCVI *MYRP 
HT*TGL* CMQVHTHTHTHTHAHTHTHTP 
LSQGGDCI 


11420 


25321 


A 


11519 


41 


232 


ESVEPRRRRVPRAQITPLHCSLGDRARP 
CL*SQLLKRLRHENLLNPGGAGCREPRS 
HHCTAAWATEQDPV* KKKKSLNFKTVRA 
INLKMLGALISFHLFLWGIWF 


11421 


25322 


A 


11520 


146 


2 


LiIDQQGVNQLPRMECSGAITAHCCLELP 
GSSDPLTSAS*VAGRPPTRP 


11422 


25323 


A 


11521 


297 


437 


TDHF WNR I LSS CGDT FRD TWLGWPHAY 
NPSTLGGRARRIT*AQEFE 


11423 


25324 


A 


11522 


244 


460 


NCATITTNSRIFLQLQKKHCTKWLSLSI 
S WQSL I SF* * I CFVLFFETNLAVSLRLE 
CSGAILAHCNLCLLSS 


11424 


25325 


A 


11523 


222 


1 


ESKDXiDIFRCQLAPRNSSCFVLFCFVFP 
LEIGSCSVAQAGAQWCNHGSLQPPPPIA 
SAS * VAGTTNVRHYVQLF 


11425 


25326 


A 


11524 


308 


424 


I TFLTMFKI YLQFD PAIPLLGI YPKDY * 
SCCYKDTCTRM 


11426 


25327 


A 


11525 


98 


2 


N I GGR P GAVAHAYNP S TLGGRGGQ I T * G 
QEFE 


11427 


25328 


A 


11526 


113 


399 


LDRFLTLLPRLRREDCLSPGGQGCSEP * 
SHRC VP AWATE * DLLS KKKKKGVLEKP F 
LPPPSVGKPPFSPPERLFFFILRPPLGG 
VFPPPSKKIIS 


11428 


25329 


A 


11527 


280 


3 


P KYCQ I * VLQCIAQ LFKLS I YNT YTHTH 
THSLSLSLSLPPPKVNKWLMSSEFTLTF 
KMYTFGPGRVTHTCNPSTLGG*GGWIT* 
GQEFETNL 


11429 


25330 


A 


11528 


284 


2 


FWGPQKKNFSFPPPGLKIGSFKRAPPFF 
FFFFETKS CS VTQAGVRWCDLGS LQPLA 
WATERD * LNK* I KGMS KKLHT * VCMFRT 
RGRTRGRTRG 


11430 


25331 


A 


11529 


134 


2 


DTLLPRLECNGTITAHCSLKLPGSGDPP 
ASAS * IAGTTGMCSHA 


11431 


25332 


A 


11530 


104 


1 


GRGFFFFFFFFF*EVESCSVAQAGVQWL 
NLGSIQ 


11432 


25333 


A 


11531 


157 


373 


CVFYTYSTSQLRCYI FIGNTRS VFRCHE 
I YT * KNR F WAS T VAHT CNFS T LG S RGKW 
II*G*EFMTSLANWR 


11433 


25334 


A 


11532 


237 


389 


ICKCSKTGFG*DFQEFETSLGNIVRPCF 
YRKI KAS QV * WHAP I VLATW E AE 


11434 


25335 


A 


11533 


259 


122 


HIS PS VQRHFFLF YF * TKPHL VAQAGVQ 
WCNLSSLQSPPPGFKRF 


11435 


25336 


A 


11534 


164 


1 


SSWSQALILLFLKKTGFWPGAVAHACN 
PCTLGGRGERIT*GQKFETSLASINL 


11436 


25337 


A 


11535 


106 


310 


GGDKGTVWDTLHFFDGQKRLDHNKDHWM 
TIHAELPHELSA*CYAFKKECISGIGSI 
* AE KE YK I Q FDG 


11437 


25338 


A 


11536 


167 


378 


ASFFGTTITYKTILVYVYMFEKFIIYIH 
M YTF * RQGLVL S KL ECS NVI I GHCJSTL KH 
LGSSNLPTSASRVA 


11438 


25339 


A 


11537 


101 


396 


VNNLKQR I GLS KRGE KKLLVL I S GFGGG 
FPGAGWI KKGKPPPGFPQGLGP * KRGPP 
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C-Cy stein e, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I— Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \~ possible 
nucleotide insertion 














GKKGNFNKMGFNPKTLVFKNQNFGPGGA 
KPGTPGVRGPLGDL 


11439 


25340 


A 


11538 


18 


361 


QIGPRLY* DQNGET P SLLKI QKKKI PGF 
GGGPLKSQLLGGLGWENH*NPGGGGCRD 
P K* P QGL P P WGTNQN F VKKKEGGRE KGK 
EGRKEGGGEKKKTLPKGIGALNLGGLLT 
SP 


11440 


25341 


A 


11539 


134 


365 


PGFQNKKTGGGGPFRPKTKTNFKVF* KY 
LKICFGFWPKRPPPPGFFILETRSLFFI 
FFLGF FF * ETKSRS VTRLE CSGVILAHC 
NPRLVGSSHYP 


11441 


25342 


A 


11540 


2 


318 


TTERFIYRITFGPKETEVNDLPGAPSEP 
VAALRLEARSFILKL*SFSLVFDFFFLP 
F*KTGS CLSPRLECRGEI ITVHCSRELP 
S*SHPPTSASQSSGSGGFHLG 


11442 


25343 


A 


11541 


294 


409 


GT I S F FRTCRLGMVAHT WNP S TLGGQGG 
*II*GQEFQT 


11443 


25344 


A 


11542 


1 


123 


GKQRQE PALSYDHATVLLTS * S AHLGLP 
KCWDYRSDPPCLA 


11444 


25345 


A 


11543 


69 


350 


VGAKHW I HMDT K I E TAD PGN Y YS GE RGK 
RPRAEKLPVCYYALYLGNGI I CNPKPPC 
CWKGHDFIYIYVCGHIYIHIYICTHIYI 
YFF*DGGLLC 


11445 


25346 


A 


11544 


103 


340 


LFCAFFLSPSKLDFYHLSFYV*TLFLKD 
LISLAFFFLFWEAGS CSVTQAGVQGPNS 
THCNLCLPGSNNPS TSASQAAGT 


11446 


25347 


A 


11545 


256 


354 


PYPFGKPKRGDHWGLGV*NQPGQQGETP 
SLLKI 


11447 


25348 


A 


11546 


196 


335 


E I VARL KRS P VGRAQ WLM P V I P ALW KAK 
ESGSPEVRSS*VKKITCRPGTVAHACNP 
S TLEGQGKWI T * GQE FKTS LANM VK 


11448 


25349 


A 


11547 


202 


377 


T IHTKGVMVGNF I L WFYQNFFKTFF * D 
KILGWARWPMPL I PAFWES EMGGLLE P K 
SL 


11449 


25350 


A 


11548 


396 


26 


P PR I KKE KRS PNGAF YPFP C WNF * KKKN 
LSLAKRFPRDSVFFFVSW*GSYLPTTFL 
QLLRWYVLTSHI AVS FKHLGE PHKNFTL 
QHS I PYLYI I E I EVYS YF I NRNT IYKIN 
MDFS I VGFVFL 


11450 


25351 


A 


11549 


232 


371 


REVPFENIKIGQVQWLTLIIPALWEVKA 
GGSLEARSL* KYQNWPGAVAHAYNPSTL 
GGQGGRIA*GQKFE ISPANMAKP 


11451 


25352 


A 


11550 


174 


381 


NRDEGFR YADRAE VQRLLTGTI LEHCGIi 
QRPGL * RS S CFTLP CGWDYRH* S PSHVA 
GTTGISHHTRLIKKAFFSQTGVSSC 


11452 


25353 


A 


11551 


262 


359 


DYSNLGLVWWLTT VI PALWE AKAGRSSE 
VRSS * PALWEAKAGRS S EVRS S 


11453 


25354 


A 


11552 


70 


380 


LHKCSFIYINHCNNPIYGLISLYPKQLF 
S YENFKDFCNKI * P * I PNFELKTSTP* L 
EKX 1 *NWAMARHGGSRL* SQHFGRPRRAD 
HMRSGVPGQPGQRGESPST 


11454 


25355 


A 


11553 


319 


56 


TFFLGPPGFF*KIFFSVGPPLIFFLPKK 
FIGEQKKNAPGKNFFFGFLFSLFFFFLL 
TLiSSALECSGAMSAHDNLCLPGS SMSPA 
SAF 


11455 


25356 


A 


11554 


230 


399 


MMGE I INQANI I S VCQ I IRRLRHESHL I 
PGGRG YS E LRS YHCT S TWVT E * DS CLICK 


11456 


25357 


A 


11555 


376 


396 


F*IFWRDGVSPCCPGWSQILRLQQSTCL 



1242 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 


SEQ ID 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A=Aianine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=GJutamic Acid, F=PhenyIaIanine, 


sequence 


sequence 




09/515,1 
26 


location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


G=Glycine, HNHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














GLPKCWDYRPPHPATAAS CP YFYHYYLL 














KI WRYS FDLS CLWNL VWGLAFG 


11457 


25358 


A 


11556 


176 


384 


KAWP3WRNWYR*NFFFQCCLNNFRKRLK 
CFRCGADKFGKTGFSCLQYALKPTNWGR 
EQWLTVNPTNFGK 


11458 


25359 


A 


11557 


295 


391 


PYKLLITHTHTHTHTHSRTHTI *HTTLS 
LYTF 


11459 


25360 


A 


11558 


1210 


1753 


NTWCMERLAVAFGS FNRNS PLRGWALWL 
TP 1 1 LALWE AEAGGPLE PQE FE TCLGNM 
VKHCLYKNLKNVSGV*DLPGQHGETLSL 
QKFKKCVRRGGTCL*SQRLRRLSWEDCL 
SQGGQGCSEP*SHQCVPAWVTELDPDSK 
KTKT KKF FGL P S P FLF VC FLTH I YVNKE 
YAFVLAEEASGKTTSKLTMVTSRNGIjGK 
TKKFFGL 


11460 


25361 


A 


11559 


172 


3 


KQKFFFPPPSHQIKIFGGGAPEMPFFFF 
FFEKESRFVA*AGVQWRNLSLLHPPPHP 


11461 


25362 


A 


11560 


143 


1 


GQAQWFMPVIPVLWEAKAG* SLEARSSR 
PVCATATPP QL I PK* YEGL 


11462 


25363 


A 


11561 


84 


2 


KPLDTLGGQGGCIT*GQEFETSLANMV 


11463 


25364 


A 


11562 


12 


387 


QGILLPCFSMSSEERGRIYSNTFSFFFF 
FGKGGQINPQGGGQATEPRLREPSPSGL 
TLQGPGNGGPPPPPGQNFFFKKKGGYPG 
GAGGVLN PGP KGNT P PG P P KARKKRGG P 
PGPGQKFL*NPKG 


11464 


25365 


A 


11563 


230 


375 


RAKGKLLSESLLS IQSFKKPSVHTCNPS 
TLGG * GRWIT * GLE FTTSLA 


11465 


25366 


A 


11564 


287 


55 


PPDGRVQWLTPWPGLWEAKEGGSLKSR 
S S * S Q I NPVGSVPYS LRVLR 


11466 


25367 


A 


11565 


115 


411 


FFFFFFFLGENSFFSFRGGPRGGGHIFW 
NFPPPG*GVFPPPPPPGGGNKGGAPPPG 
FFFFFP * KTGFPPLGPGGPPPPLFGPPP 
FPPPKGYNFRRDPPP 


11467 


25368 


A 


11566 


139 


356 


CLLIGWYNTGGITQGSTRKIGTTFFLRQ 
SSSVAQAGVKCSGMILCYRNLRLPGSSN 
LPDSAS * VAG T TAT YP 


11468 


25369 


A 


11567 


101 


410 


FLFFFFFFFLKKIFFFPPRGGGGGGN*K 
KQTPPLGGKKNFPPPPPKRGGGGTPPPP 
P LL FLVF * GE GG F P GGAGGG KKT PKGNP 
PPPPPKGGEMGRGPPPRGG 


11469 


25370 


A 


11568 


417 


56 


EDCLRTGVRDLLPNPHPNLQ I AGRDGAR 
L* SQLQEHHLS PGFRGYSEL * S CHCS PA 
WATE * AYRKQNKTKQNYRS S ALLHL YT P 
MATAWIKIAFLRAGDIFLSVLSFLPNSY 
PLFSHPSQ 


11470 


25371 


A 


11569 


301 


443 


EDTTFVCFEVESCSVAQATDSPASAS* I 
AG 1 1 GVCHHARL I FVFLVE 


11471 


25372 


A 


11570 


44 


408 


RGRGHGPHS P WRRRRRLRWEGRKATE S 
KRRAYKYPLPALLCSHQWKPNEWLDSQI 
KIPTGL * * LK* PS F I FI F I YLFWKQGLS 
LAPMLECRGAVIAH*HLRLPRSSLSHAS 
ASQVAVTTG 


11472 


25373 


A 


11571 


201 


1 


AERP VTGPP VYAEPGLQGRQG* KNMFKN 
MQHPRWADHLRSGVQDQPGQYGETPVST 
KNTKITQAQCI 


11473 


25374 


A 


11572 


343 


1 


FGLLFAFTLEYKLYES INHFAHCCVCRF 
S SGTLHVWCNRHPNKLDE WNSGGGGRQ 
DLTLTPRLGCSGVIMAHCNLVLLGIjSNP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P— Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VSAS *VAGTIGAHSHAWIDKLFFVKTGP 
LY 


11474 


25375 


A 


11573 


209 


3 


VGPPPKSLPGGFPFFFCPKCFPRLKKKK 
KKPGLVAYTCNLNTSGVRGKKIT*AQEF 
KKSPSNIVKPWY 


11475 


25376 


A 


11574 


312 


430 


QRYARPGP WHCYNS STVGGRSRGI V* A 
QEFATSLCNMT 


11476 


25377 


A 


11575 


125 


3 


NS LG P VVHACNPRS LGG * GGR I T S AQE F 
STSLGKIRRPLY 


11477 


25378 


A 


11576 


190 


362 


CVNNYID* EKKKKKKKREREKKKKKKKK 
KKKKKK* GGGAP KKKKKFT PGGGKNF F F 
Li 


11478 


25379 


A 


11577 


231 


340 


KRITRVNQDGLDLLTS*SAPLGLPECWD 
YTMKPCLY 


11479 


25380 


A 


11578 


60 


3 


RMQVS V* I FTPMS LQAYL I YS S S VAAGA 
QSGIEECKYQFAWDRWNCPERALQ 


11480 


25381 


A 


11579 


285 


406 


KKGNW PGAMAHT CUP S TLRGRGGWI A* G 
QEFETNVGDMAK 


11481 


25382 


A 


11580 


121 


1 


NLNCPGPVAHASNPNTIGG* CRPIV*GQ 
EFYTRIANTVKP 


11482 


25383 


A 


11581 


199 


384 


KKTPPGGIFFCPKEAGPAPPL*TPFC*T 
P L F F S PAP FKP KRAL KFFLKP KRG FFQ I 
PFFFF*DGVSLCHPGWSTVARSCPRV 


11483 


25384 


A 


11582 


677 


934 


YGSRHCI CFLQAI S E ILFLKNPARHGGS 
QVQDQPGQQSKTLTLRKKSK*ARHNGS C 
LSSQNFGRLRQEGHLSPGG*GCSEL*PR 
HC 


11484 


25385 


A 


11583 


104 


2 


KKIGGGPPLFFFFETESCSVARLE*SGA 
ISAHCN 


11485 


25386 


A 


11584 


349 


9 


GGRSLGPRKTRVQWGIPEFPPLHLGGKA 
KPCFPKKKKKKVKGLNKHYSKEDIQMVN 
RYMKKCSTSLPRERKIKTIMRYHFPSIR 
MAIIRKIKDPKDQ*GCIEKGGSWRPMY 
L 


11486 


25387 


A 


11585 


163 


1 


TAVRI KHNS YTLTP CLRHSRYL I NVTC W 
P GP VAHACNP S TLGGRGRW IT* GQE F 


11487 


25388 


A 


11586 


192 


41 


S KS K YH I YGDS ALQ KS T VY KW I THFKKG 
*DDVKDKAHSTRLSMSICEEKE 


11488 


25389 


A 


11587 


30 


374 


GWSPEDLFPGSLPPALKGFSGEGG* PFL 
P FQKKRGGGGGS S P P F P VL KRVR PENRL 
YPGGGGFR*PKQGPCPSAWGAEPDSLSK 
KKKGFGQKKKKKKVLKEKT 


11489 


25390 


A 


11588 


66 


245 


SLCFFVFFVFF*TESCSVTQVGV*WYNL 
GS LQ P P P P KKKKN I KKQ KKK INWGRGF K 
RRL 


11490 


25391 


A 


11589 


243 


402 


FLHLKKKKKKKKKKKKKKKKKN* KDAG 


11491 


25392 


A 


11590 


205 


3 


IRLLGI IKNPTPAHCQ*K*KTGEPLW*V 
VW * F FRRLHTELS YDPAI PLLC INS TEV 
WLAPGIPPRFR 


11492 


25393 


A 


11591 


404 


152 


MGFHHVSQDGLDLLTS * STLLSLPKCWD 
*CWDYTREPLGPGHFAISFNKWLNTTT 
VVLYFYLI ILHCIYCPYKLLFLFFFYIY 


11493 


25394 


A 


11592 


311 


386 


TVYPPLAGNLAHAGA* AGI VGTALS ILI 
RAELGQPGALLGDDQ I YWI VTAHAFVI 
IFFIAIPIIIGGFGN*LVPLIIGAPDIA 
FPRINNI S F * LLP P S FLLLLAS S I VEAG 
AGTG * TV Y P P LAGNLAHAGAS VDLT IFF 
LH 



1244 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide - 
location 
correspond 
ingto first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GJutamic Acid, F=PhenylaIanine, 
G— Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L==Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11494 


25395 


A 


11593 


39 


397 


EGCPLFFFVFFFFFFFGGKKIFFFPPRG 
GGGAPHK I PG P P P P GGNF F F P P P P PLRG 
GKRGPPPPPGFFFFF*KKGGVSLFPRGG 
VFFFSFWRGGRGGFPPPPPQKKIFLKGP 
PPFYKKK ' 


11495 


25396 


A 


11594 


243 


46 


GILSCCFHDSLSLNSL*CD*VWISLCLS 
YLVSIELLGQCGLIFFIKRESFWLLLLK 
MYLQPLSFLS 


11496 


25397 


A 


11595 


327 


2 


PRAILRIQFLLWPMRPYVMWSQPCSLCF 
CHTAKTVPTSRPLLLGFFSLPGIAHISM 
WLVLSYGLGFSSKVTLWPGWAHVCNPS 
TLGG * GGW IT* GWE FKTSLANMVK 


11497 


25398 


& 


11596 


292 


384 


LGTVAHTCNPSTLGGQDGWIT* IQEFET 
SLR 


11498 


25399 


A 


11598 


125 


2 


P FQGQRS GGTRGPGAVAHT YNP S TLEG * 
GGR IT* GQEFE TS 


11499 


25400 


A 


11599 


302 


408 


TGGWLG WAHACNP S I LGGRGRQ I T *GQ 
ELETRLA 


11500 


25401 


A 


11600 


409 


131 


GRSGGVPKSGVLKPPGPPGGTPFFFKKP 
KITRGGGQPPLFPLLQRVRPENCLSLGS 
GGCH*TKLPPCPSTWGAQ*KLVSKKKKC 
EGGGRIYS 


11501 


25402 


A 


11601 


196 


330 


HHCMLAWVTGRPYLNIYICIYADIYIHI 
YICIYIYTHTRVYIYLLAI* IYIYVYMR 
IYIYIYIYVYIYTHTHGYIYIY 


11502 


25403 


A 


11602 


41 


252 


TPGRARGQRLiGHSSTGLDLLTS * SACLS 
LPKCWDYRREPPRPAEDIKFMF*AILSE 
TILVRGVRSRATPS 


11503 


25404 


A 


11603 


271 


440 


KKKKRGGRFKESKFTSAGLQGNI FFIGP 

PKLI praav* qrgdwknlg vtqlnrlch 


11504 


25405 


A 


11604 


429 


2 


FS S FFAEKGCHCVS QAGFKLLS SRDLPT 
SAPHLGIRLLGRLRQEDPPSPGV*GCSE 
LQSRLCTPAWITKRDP VS * KFKKQKTKT 
HTH*RKPTKGQDHQYHCLPPPHVMPLKV 
FRGSNVHGAVRTRGSRGSIPELSHAVPH 
AP 


11505 


25406 


A 


11605 


113 


2 


TEATHLVCTYKN* KLGWAWW I MP VI PAL 
WEAEVGRLL 


11506 


25407 


A 


11606 


301 


3 


W VM VC S VKN I ALGHACG KFH YE GRGKP A 
EHPSWVLLVLESLQKKLFLSYFLFTFLL 
QPPSPRLT* P VFR FF FFRDR I ML CCPGW 
S AWPERVGRAGNS E 


11507 


25408 


A 


11607 


117 


431 


SCDFCLSFSLSLFFFFFRKGIFFLGPRG 
ENKGGNKIKWTLGPGGKGNSRPSPPKGG 
GKRGGPPPGANLDF*KKTGFPQGGRGGE 
KPPPPGGRAGGAPQRGGKPAG 


11508 


25409 


A 


11608 


277 


88 


RRTTSSWLARNCIQGPAGYHTGS*TPDL 
KG S ACL S F P KCWD YRWRLDL 


11509 


25410 


A 


11609 


200 


27 


VRLKPGDSRDPDMQWYYI*ENRPARCGG 
S LL * S * H FGRKRKADHLRLG VQD TRS S A 
R 


11510 


25411 


A 


11610 


17 


414 


TVWLVMQDPEMEWIVSGWGGKEEPGRYF 
MCEARSSSMDRVFFFF*KKTLFFPPGGG 
GGAKIWVNG I PPPQGS PLSVAYPLGVWE 
KKGPPIHPHKKRVPPV* PGGVLFSEPGN 
PPPGPPKGVGKRSSPPSPVL 


11511 


25412 


A 


11611 


296 


428 


R I YENWRGT VAHACN P S TLGGQGGW IT* 
GQSLKTSTGTVAKP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine ? H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Ty rosin e, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11512 


25413 


A 


11612 


20 


362 


NF VRGVARGS RGGATRCEWVGLRP VGMA 
YGRSPSWLL*VKPSAASHRPPLPRAADT 
PGTAP AP TPTPAPAAAPALTP S SGPGS A 
ALTLEEELQEAIRRAQRWGLTMLPRLAS 
NS 


11513 


25414 


A 


11613 


158 


2 


LMLSRTPLQQL*YTLLELELPRLIAPDL 
LSNG SSL KDLKWTH S N YRAS KE TK 


11514 


25415 


A 


11614 


3 


410 


RSEVYGLVLTTWYQVNRVPWTHKAARFT 
RVSRWARPRSKISESCYWLPQAHRTKS* 
PLVRTLSGLCSLLGFHPLLFSLLATDLV 
PLPSFALLLLLELFFLPGMPKLAP*SPT 
KRSFSFYFFRDRVLLCCPGWSQTP 


11515 


25416 


A 


11615 


189 


397 


YKVLFLVEKNGFFFLFFETKPCCVAPTG 
VQG PNFG * LQAP P P GL P L F S ALAAP E VG 
ITNGTPPPPRFFF 


11516 


25417 


A 


11616 


158 


1 


KGKPTPKTFFCQKQIFFLVFFF*TESHS 
VARAGVQWRDLDS LQAP PPGFTPR 


11517 


25418 


A 


11617 


319 


425 


KVWLYSQAV* KVWL YSQAVAHTYNPNTL 
GGQGWRIA*AYEFKTSL 


11518 


25419 


A 


11618 


94 


2 


CLT I S WLGTVAHVCNPS TLGG * GGH I TR 
SR 


11519 


25420 


A 


11619 


381 


9 


FFPLPLPLPTVSLFPRSPSDAEPKLDCT 
AAISAHCNLPA* FSCLSLPSACNCRRAP 
PRLTASASRGAGIADGVSFTQCSMVPRL 
ECSGVISAHYNLHLPATSLGLPKCRDCS 
LCPDEIVDPRI 


13520 


25421 


A 


11620 


344 


431 


I VNVI VGKGEKGRKI PE * LIR CNGWN * S 
WDRWVAE DHVLRE P S ENRRLAR VRCL T P 
VSQHFGRRRRLDHEVRSLRPACATWRNS 
VSTYNTKSGWAC 


11521 


25422 


A 


11621 


174 


449 


VFSLPLLGGASQLGYLEVRDPLEEAVCP 
FSDLKPHAGRTTTLFKAIRQGHLSLQRF 
LLP FVWLCPAPRGGVYRGRQAS LS * GGL 
HPVRASRP 


11522 


25423 


A 


11622 


316 


412 


GGP S AVAHACNP S ALGGRGGR IS* GQE F 
KTSL 


11523 


25424 


A 


11623 


183 


2 


P KQ V YLRNGL L SH S KCNK I HH I NRL KRE 
KIQ PG WAHAYNADTTLGGQGRW IT* GQE 
FENE 


11524 


25425 


A 


11624 


355 


168 


DLL I VC YDPNAINL * SLLLRRLKWED CL 
SPGG*GYSEPRSCHCSPAWAAEPDPVSK 
KDRNVEFIYCC 


11525 


25426 


A 


11625 


154 


3 


TADVLFLGQAGRGNFWMKKNCRPGTVA 
HACNPSTLGGRGG* ITRGLEFK 


11526 


25427 


A 


11626 


412 


3 


I FFPPPKKDKRTRGGQKKNI PPAPPRNF 
LGRGGNFPPPQKPFFPPLF*IFPRPP*C 
KGSIPPFLFPLCFQKKKTPSFFFLPNGS 
PPICEKIFLFPKKGFLKFKKPRFLKKKG 
P FF F F FGNAVLL CQ PGWSAVAQ S Q 


11527 


25428 


A 


11627 


290 


423 


VFEYFFKFFKKNNSWPGWAHTYNPSSL 
GGQGGW IT* SQVLRP P 


11528 


25429 


A 


11628 


213 


3 


LS G I YS GNRRLVRYS KS VS VIHH INRL K 
KKNHI I ISIDGEKPLEQIQNSFMT*KRN 
KLKWEDHLRPADA 


11529 


25430 


A 


11629 


152 


2 


FQKNRDLE P SKAGWPGMVAHACNP STLG 
ARGGQ I T * GQEFETSMVNMVKP 


11530 


25431 


A 


11630 


422 


189 


PLSGFCPSTWGAKQRALSKKKKRSEAGH 
SGSCL*SQYLGRLKQKDHPSPGGRGFSE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN • 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=GIycine, H=Histidine, I=Iso leucine, 
K=0Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proiine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LLNCT P AWVAKQD P VP KKRLK 


11531 


25432 


A 


11631 


239 


424 


KEDNNRVFVIGPLGGLNELAHDKAFKTQ 
NHHWLGVGAHACNPSTLGGQGRWTT * GQ 
EFTTSL 


11532 


25433 


A 


11632 


343 


401 


AREREHKSAACQ * GMERSVIKHCHVATE 
*EREQKRERQQERASEKERQQESERDSK 
RERARE S ERERES TRAQLVRK I VNQ I S 


11533 


25434 


A 


11633 


125 


371 


FQG CAKI CLTNTLLP YAHGYL I WRHDGK 
GCHEKMTAKI IHS CKLSPDAL* SQLGVP 
L * THTHTHTHTHTHTHTFS E KKGLVP 


11534 


25435 


A 


11634 


235 


433 


AALTSFSGPFSFCSYLINTLGKNFPNHE 
I S TT * FTLQRL * K 1 1 F WPGM VAHACNP S 
TFGGQGRWIT 


11535 


25436 


A 


11635 


93 


385 


FTLKS ERRKHS VPQGKKQNKI KLLS PDC 
KLENKRFGWARWLTPVI PAL * E AKAGE S 
PEGNGEREEATEPFGSIILGFFFCTQIA 
HLHRSDYEPNEDS 


11536 


25437 


A 


11636 


190 


398 


NTKKSQKAFFKTQGKPPPLDPKKKASKK 
TKF FLP KKI S trkfrtgp wrg tnfs lkk 

AKKNLGKTRKKI F P * RP P F PS I S SLFWP 
FFFFLFF*KDKVPLCHRGWSAWQSQLT 
ATSISQMY 


11537 


25438 


A 


11637 


190 


380 


TPKKWPKAFFKTKGKPSPFHPRKKALRK 
TKFFLPQKFP*NPLSPLIPALFWPFFFF 
FF F * KD KVP L CHRGWSAWQ S Q LTATS I 
SQMY 


11538 


25439 


A 


11638 


261 


91 


EKTLWAGHGGTCL * SQLLGRLRWENHLN 
LGGRG*RSRHCTPAWVTERDSVSNK* IN 
K 


11539 


25440 


A 


11639 


1 


334 


FRANRTVKDAHS IHGTNPQ YLVEKI IRT 
RIYESKYWKEECFGLTAELVVDKAMELR 
FVGGVYGGNIKPTPFLCLTLRMLQIQPE 
KDIIVEFIKNEDFK*VQCSLANIRGMY 


11540 


25441 


A 


11640 


193 


457 


DGILLCGPGWSAVAPSQLCSGTISAYCJST 
LCL PGS S ES PVS AP * YLGLQ AC AHAPLM 
FVFLVYTGTCMLPLSAHHSRAANSSHFS 
YPLL 


11541 


25442 


A 


11641 


257 


373 


NTISCFRSLRPVAHTCNPSTLGG*DGQI 
T*AQEFETCLG 


11542 


25443 


A 


11642 


211 


367 


DTTSHPFEWL*FKKSRK*RVLDRIWGN* 
MliVH C W WEHKM VQ LL * KT VWWL F K 


11543 


25444 


A 


11643 


112 


367 


LQ I KT X R YHH I LTRMAK I QNTD TNKC YQ 
E CGATGTL I HC WWE FKI I P VI PALWE AE 
VGGLLEVRSSRPAWPTRENPIFTKIIKLT 
Y*KYKNTPYIIR*LQIKTIRYHHILTRM 
AKI QNTDTNKC YQE CGATGTL IHCWWE F 
KIIPVIPALWEAEVGGLLEVRSSRPAWP 
TRENP I FTKI I KNY 


11544 


25445 


A 


11644 


310 


335 


YHLIAGSLGDIERLSML* IGKAVCHNKY 
* *GCGEPGFLLHCWWEWKMVQSL*KIVC 
QILQT 


11545 


25446 


A 


11645 


143 


455 


KRKGP S LWDLGKSMGGI RF F * KKVFFSG 
FKSLWFFFWPLWKPLSRLN*IKISFYPP 
AKKKGGQRKAQFFFFFFPRSFSLAGCSG 
S ISAHCNLCLLGLSDSPTS 


11546 


25447 


A 


11646 


182 


458 


SRGGIRFF*KRVFFSVIKSIWFFFWPLL 
KPLGRVN* IKISFNPPAKKRGWERKAQF 
FFFFLPRSFSLAGCSGSISAHCNLCLLG 
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SEQ ID J 


M 


SEQ ID 


Predicted 


Predict- 


Amino acid sequence (A— Alanine 


NO: of 


NO: of 


eth 


NO: in 


beginning 


ed end 


C=Cysteine, D=Aspartic Acid, 


nucleotide 


peptide 


od 


USSN 


nucleotide 


nucle- 


E=Glutamic Acid, F=PhenylaIanine, 


sequence 


sequence 




09/515,1 


location 


otide 


G=Glycine, H=Histidine, I=Xso!eucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LSDSPTS 


11547 


25448 


A 


11647 


319 


3 


R S TKYS PRGWE P S MWFL FLHVF S L P FRS 
ILGPPLLGTPLRGSPPPPPPFEGKGFIY 
GLTFFFP*EPPCEIFFFFFLDRVLLYCP 
GWS AWQSAHCS LELKDKVC I 


11548 


25449 


A 


11648 


165 


2 


DQSGRPRKI LCAHTHTHAHMHTS LGQT * 
QD P V * THTHTHTHTHT P L YAAAQR I N 


11549 


25450 


A 


11649 


290 


3 


S KQNTTH WATLKS Y IYPSQFWRLGS PS S 
RWQLDGFL VRAL FLVCRQP PSLRI LTWW 
EKTEERKREREK* DRERERNE I ERNERE 
RERERETQRMY 


11550 


25451 


A 


11650 


220 


2 


TPTLPKGTGLTALDRI PGRDPGPRRAGR 
PF*AGVAAENCPDPAEGGSTRPLGAPGR 
GSAPP *RHPGPAGGRR 


11551 


25452 


A 


11651 


29 


387 


FLIFDVTIVIDLACHEPHSYKMAHLINV 
VCVLTAPPTSYFPISFLGPPYSLRHNDI 
EIRPINUPTRTSQCS'SERKSHTSLTLNQ 
KLEMTTLS KDGMS KAKTG* E PGL * C* TV 
SQAVNAK 


11552 


25453 


A 


11652 


267 


2 


KFFFPVFSTSKLNFYPGTPFPLFLFSFF 
F F F F F F FMREG FAVT W VPGAVAP GAVLA 
HCS LE LVG S GKP P I S AS * GAGT TGMHYH 
TRMY 


11553 


25454 


A 


11653 


50 


411 


E AVAGGME KS MMNL P KG PDT L C FD KDE F 
MNED FD VDH F VS D CRNRGQ LEE LRDNP Q 
LYYKLLKTAMVELINKDYADFVNLSTDL 
VGMEQALNQLSVPL * QLREE VLS L * S FV 
IERISPSS 


11554 


25455 


A 


11654 


303 


403 


VRNMVE WLGVAAHACNP STLGGQGEQI T 
*GQEF 


11555 


25456 


A 


11655 


356 


653 


SVSPCPFPGPASTLSFLVADFRRRGVDV 
S QVAWQS KGDTPS S CC I INNSNGNRTI V 
LHDT * G PRAS P CYNP P I S WLNQF PHS P P 
WASQL P P S VAS VYQK 


11556 


25457 


A 


11656 


442 


2 


GRP I LMRPAVL *SEPRT PACVD I QQQIM 
T I IDE WKACAKVQTLSAP INSASRMQS 
IRHWYILKDSSARPAGKGAI IGFIKVG 
YKKLF VLDDREAHNE VE PLC I LD FYIHE 
SVQRHGHGRELFQYMLQKERVEPHQLAI 
DRPSLV 


11557 


25458 


A 


11657 


109 


474 


D INMCGRWLVAERM* KRKRDKYNYEET 
E * VL * KKVRHSQLQKENVAQNAP WQNA 
AYIDQPS PAHVGQQGLS KLPSRPGDQGV 
EPQNLRTLQGRSVIRSATNTTLPHMLMS 
QRVIAPNPAQLQGQRGPPKHGICRTTTP 
NMNPANV 


11558 


25459 


A 


11658 


342 


3 


ENQLKNGEHSKSQSASSPPNDHNTSLAR 
A* KWAGAEAE IALLTEAGFGRWVIMNFA 
ELKEHWTQCKEAKVHDKTTQEL I AGFE 
RNITYLM*LKITTREPHIAITSINSRMA 


11559 


25460 


A 


11659 


204 


17 


STCTPLPTA*SQSRPHPPPTPTPPAPWR 
SSNGPLSRGSSSRNSTSSWSIALATGPW 
THRSEL 


11560 


25461 


A 


11660 


505 


514 


GS*GNHFNPDGASHGGPQDSDRTGPRGP 
CLPS PWGAQ I LRDW YPQEEMGTLGKQ 


11561 


25462 


A 


11661 


80 


374 


FLFDI YNRCYLI F INT I * YLSSNFD* I F 
I IIYLVAI YRLLCLGLFRAPRQCKHPRP 
Q F S FNK I GT KR * AW P GAMABT CNPNTLG 
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